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PESTICIDAL AGENTS 

The present invention relates to materials, agents and 
5 compositions having pesticidal activity which derive from 
bacteria, and more particularly from Xenorhabdus species. 
The invention further relates to organisms and methods 
employing such compounds and compositions. 

There is an ongoing requirement for materials, agents, 
compositions and organisms having pesticidal activity, 
for instance for use in crop protection or insect- 
mediated disease control. Novel materials are required 
to overcome the problem of resistence to existing 
pesticides. Ideally such materials are cheap to produce, 
stable, have a high toxicity (either when used alone or 
in combination) and are effective when taken orally by 
the pest target. Thus any invention which provided 
materials, agents, compositions or organisms in which any 
of these properties was enhanced would represent a step 
forward in the art. 

Xenorhabdus spp. in nature are frequently symbiotically 
associated with a nematode host, and it is known that 
25 this association may be used to control pest activity. 
For instance, it is known that certain Xenorhabdus spp. 
alone are capable of killing an insect host when injected 
into the host's hemocoel. 

30 in addition, one extracellular insecticidal toxin from 
Photorhabdus luminescens has been isolated (this species 
was recently removed from the genus Xenorhabdus, and is 
closely related to the species therein). This toxin 4 is 
not effective when ingested, but is highly toxic when 

35 injected into certain insect larvae (see Parasites and 
Pathogens of Insects Vol, 2, Eds, Beckage, N. E. et 
al., Academic Press 1993). 



15 
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Also known are certain low-molecular weight heterocyclic 
compounds from P. luminescens and X.nematophilus which 
have antibiotic properties when applied intravenously or 
topically (see Rhodes, S.H. et al., PCT WO 84/01775). 

5 

Unfortunately none of these prior art materials have the 
ideal pesticide characteristics discussed above, and in 
particular, they do not have toxic activity when 
administered orally. 

10 

The present invention provides pesticidal agents and 
compositions from Xenorhabdus species, organisms which 
produce such compounds and compositions, and methods 
which employ these agents, compositions and organisms, 
15 that alleviate some of the problems with the prior art. 

According to one aspect of the present invention there is 
disclosed a method of killing or controlling insect pests 
comprising administering cells from Xenorhabdus species 
20 or pesticidal materials derived or obtainable therefrom, 
orally to the pests. 

A PCT application of CSIRO published as WO 95/00647 
discloses an apparently toxic protein from Xenorhabdus 
25 nematophilus; however no details of the protein's 

toxicity are given, and certainly there is no disclosure 
of its use as an oral insecticide. 

Thus the invention provides an insecticidal composition 
30 adapted for oral administration to an insect, which 
composition comprises a pesticidal material obtainable 
from a Xenorhabdus species, or a pesticidal fragment 
thereof, or a pesticidal variant or derivative of either 
of these. 

35 

The composition may in fact comprise cells of Xenorhabdus 
or alt rnatively supernatant taken from cultures of cells 
of Xenorhabdus species. However, the composition 
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preferably comprises toxins isolable from Xenorhabdus as 
illustrated hereinafter. Toxic activity has b en 
associated with material encoded by the nucleotide 
sequence of Figure 2. Thus, the composition suitably 
5 comprises a pesticidal material which is encoded by all 
or part of the nucleotide sequence of Figure 2. 
Pesticidal fragments as well as variants or derivatives 
of Buch toxins may also be employed. 

10 The sequence of Figure 2 is of the order of 40kb in 

length. It is believed that this sequence may encode more 
than one protein, each of which may regulate or be 
insecticidal either alone or when presented together. It 
is a matter of routine to determine which parts are 

15 necessary or sufficient for inBecticidal activity. 

As used herein the term ^variant" refers to toxins which 
have modified amino acid sequence but which share similar 
activity. Certain amino acids may be replaced with 

20 different amino acids without altering the nature of the 
activity in a significant way. The replacement may be 
by way of ^conservative substitution" where an amino 
acid is replaced with an amino acid of broadly similar 
properties, or there may be some non-conservative 

25 substitutions. In general however, the variants will be 
at least 60% homologous to the native toxin, suitably at 
least 70% homologous and more preferably at least 90% 
homologous . 

30 The term ^ derivative 1 1 relates to toxins which have been 
modified for example by chemical or biological methods. 

These toxinB are novel, and they and the nucleic acids 
which encode them form a further aspect of the invention, 

35 

A preferred Xenorhabdus species is the bacteria 
X.nematophllus* Particular strains of X.nematophilus 
which are useful in the context of the invention are 
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ATTC 19061 strain, available from the National Collection 
of Industrial and Marine Bacteria, Aberdeen, Scotland 
(NCIMB) . In addition, suitable strains include two novel 
strains of Xenorhabdus which were deposited at the NCIMB 
5 on 10 July 1997 and were designated with repository 
numbers NCIMB 40886 and NCIMB 40B87. These latter 
strains form a further aspect of the invention. 

All strains have common characteristics as set out in the 
10 following Table 1. 





Table 


1 

Strains 




Characteristics 


ATCC 19061 


NCIMB 40887 


NCIMB 40886 


Gram strain 


negative 


negaHve 


'negative 


Shape/size 


rods up to 


rods up to 


rods up to 




4nm long 


4 jim long 


4nm long 


Motile 


Yes 


Yes 


Yes 


Bioiuminescent 


No 


No 


No 


Colour on NBTA* 


blue 


blue 


blue 


insecticidal on 








ingestion by 


yes 


yes 


yes 


insects 








Production of 


yes 


yes 


yes 


Antibiotics 








Resistant to 








ampicillin 


yes 


yes 


yes 


(50jig/ml) 








colony 


circular 


circular 


circular 


morphology/ 


convex 


convex 


convex 


colour 


cream 


cream 


cream 



15 *NBTA (Oxoid nutrient agar containing 0.0025% 
bromothymol blue and 0.004% tetrazolium chloride) 

Preferably the pest target is an insect, and more 
preferably it is of the order Lepidoptera, particularly 
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Pieris brassicae, Pieris rapae, or Plutella xylostella or 
the order Diptera, particularly Culex quinquefaciatus . 

In a preferred embodiment of the invention, cells from 
5 Xenorhabdus species or agents derived therefrom are used 
in conjunction with Bacillus thuringiensis as an oral 
pesticide . 

In further embodiments, rather than using Bacillus 
10 thuringiensis itself, pesticidal materials obtainable 
from B. thuringiensis (e.g. delta endotoxins or other 
isolates) are used in conjunction with Xenorhabdus 
species . 

15 The term 'obtainable from* is intended to embrace not 

only materials which have been isolated directly from the 
bacterium in question, but also those which have been 
subsequently cloned into and produced by other organisms. 

20 Thus the unexpected discovery that bacteria of the genus 
Xenorhabdus (and materials derived therefrom) have 
pesticidal activity when ingested, and that such bacteria 
and materials can be used advantageously in conjunction 
with B . thuringiensis (and toxins or materials derived 

25 therefrom), forms the basis of a further aspect of the 
present invention. The pesticidal activity of 
B. thuringiensis isolates alone have been well documented. 
However, synergistic pesticidal activity between such 
isolates and bacteria of the Xenorhabdus species (or 

30 materials derived therefrom) has not previously been 
demonstrated. 

In still further embodiments of the invention, culture 
supernatant taken from cultures of Xenorhabdus species, 
35 particularly X. nematophilus, is used in place of cells 
from Xenorhabdus species in the methods above. 
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All of these methods can be employed, inter alia , in pest 
control. 

The invention also makes available pesticidal 
5 compositions comprising cells from Xenorhabdus species, 
preferably X.nematophilus, in combination with B. 
thuringiensis. As with the methods above, a pesticidal 
toxin from B.thuringiensis (preferably a delta endotoxin) 
may be used as an alternative to B.thuringiensis in the 
10 compositions of the present invention 

Likewise, culture supernatant taken from cultures of 
Xenorhabdus species, preferably, X.nematophilus may be 
used in place of cells from Xenorhabdus species. 

Such compositions can be employed, inter alia, for crop 
protection eg. by spraying crops, or for livestock 
protection. In addition, compositions of the invention 
may be used in vector control. 

The invention further encompasses novel pesticidal agents 
which can be isolated from Xenorhabdus spp. Techniques 
for isolating such agents would be understood by the 
skilled person. 

In particular, such techniques include the separation and 
identification of toxin proteins either at the protein . 
level or at the DNA level. 

30 The applicants have cloned and partially sequenced a 
region of DNA from Xenorhabdus NCIMB 40887 which region 
codes for insecticidal activity and thi6 is shown as 
Figure 2 (SEQ ID NO. 1) hereinafter. Thus in a preferred 
embodiment the invention also provides a toxin which is 

35 encoded by DNA of SEQ ID No. 1 or a variant or fragment 
thereof . 



20 
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The invention also provides a recombinant DNA which 
encodes such a toxin. The recombinant DNA of the 
invention may comprise the sequence of Figure 2 or a 
variant or fragment thereof. Other DNA sequences may 
5 encode similar proteins as a result of the degeneracy of 
the genetic code. All such sequences are encompassed by 
the invention. 

The sequence provided herein is sufficient to allow 
10 probes to be produced which can be used to identify and 
subsequently to extract DNA of toxin genes. This DNA may 
then be cloned into vectors and host cells as is 
understood in the art. 

15 DNA which comprises or hybridises with the sequence of 
Figure 2 under stringent conditions forms a further 
aspect of the invention. 

The expression ^hybridises with" means that the 
20 nucleotide sequence will anneal to all or part of the 
sequence of Figure 2 under stringent hybridisation 
conditions, for example those illustrated in "Molecular 
Cloning ,, , A Laboratory Manual" by Sambrook, Fritsch and 
Maniatis, Cold Spring Habor Laboratory Press, Cold Spring 
25 Harbor, N.Y. 

The length of the sequence used in any particular 
analytical technique will depend upon the nature of the 
technique, the degree of complementarity of the sequence, 

30 the nature of the sequence and particularly the GC 
content of the probe or primer and the particular 
hybridisation conditions employed. Under high 
stringency, only sequences which are completely 
complementary will bind but under low stringency 

35 conditions, sequences which are 60% homologous to the 

target sequence, more suitably 80% homologous, will bind. 
Both high and low stringency conditions are encompassed 
by the term "stringent conditions" used herein. 

SUBSTITUTE SHEET (RULE 26) 
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Suitable fragments of the DNA of Figure 2, i.e. those 
which encode pesticidal agents may be identified using 
standard techniques. For example, transposon 
5 mutagenesis techniques may be used, for example as 

described by H.S. Siefert et al., Proc. Natl. Acad. Sci. 
USA, (1986) 83, 735-739. Vectors such as the cosmid 
cHRIMI, can be mutated using a variety of transposons and 
then screened for loss of insectidal activity. In this 
10 way regions of DNA encoding proteins responsible for 
toxic activity can be identified. 

For example, the mini-transposon mTn3(HIS3) can be 
introduced into a toxic Xenorhabdus clone such as cHRIMI, 

15 hereinafter referred to as * clone 1 ' , by electroporating 
cHRIMI DNA into E.coli RDP146(pLB101 ) and mating this 
strain with E.coli RDP146 (pOX38) , followed by E. coli 
NS2114Sm. The final strain will contain cHRIMlDNA with a 
single insertion of the transposon mTn3(HIS3). These 

20 colonies can be cultured and tested for insecticidal 
activity as described in Example 8 hereinafter. 
Restriction mapping or DNA sequencing can be used to 
identify the insertion point of mTn3(HIS3) and hence the 
regions of DNA involved in toxicity. Similar approached 

25 can be U6ed with other transposons such as Tn5 and mTn£- 

Site directed mutagenesis of cHRIMI as outlined in 
"^Molecular Cloning, A Laboratory Manual" by Maniatis, 
Fritsch and Sambrook, (19B2) Cold Spring Harbor, can also 
30 be used to test the importance of specific regions of DNA 
for toxic activity. 

Alternatively, subcloning techniques can be used to 
identify regions of the cloned DNA which code for 
35 insecticidal activity. In this method, specific smaller 
fragments of the DNA are subcloned and the activity 
determined. To do this, cosmid DNA can be cut with a 
suitable restriction enzyme and ligated into a compatible 
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restriction site on a plasmid vector, such as pUC19. 
The ligation mix can be transformed into E. coli and 
transformed clones selected using a selection marker such 
as antibiotic resistance, which is coded for on the 
5 plasmid vector. Details of these techniques are 

described for example in Maniatis et al, supra, (see 
p390-391) and Methods in Molecular Biology, by L.G. 
Davies, M.D. Dibner and J.F. Battey, Elsevier, (see p222- 
224) . 

10 

Individual colonies containing specific cloned fragments 
can be cultured and tested for activity as described in 
Example 8 hereinafter. Subclones with insecticidal 
activity can be further truncated using the same 
15 methodology to further identify regions of the DNA coding 
for activity. 

The invention also discloses an isolated pesticidal agent 
characterised in that the agent is obtainable from 

20 cultures of X. nematophilus or variants thereof, has oral 
pesticidal activity against Pieris brassicae, Pieris 
rapae and Plutella xylostella, is substantially heat 
stable to 55°C, is proteinaceous , acts synergistically 
with B.thuringiensis cells as an oral pesticide and is 

25 substantially resistant to proteolysis by trypsin and 
proteinase K. 

By 'substantially heat stable to 55°C is meant that the 
agent retains some pesticidal activity when tested after 
30 heating the agent in suspension to 55°C for 10 minutes, 
and preferably retains at least 50% of the untreated 
activity. 

By 'substantially resistant to proteolysis' is meant that 
35 the agent retains some pesticidal activity when exposed 
to proteaseB at 30°C for 2 hours and preferably retains 
at least 50% of the untreated activity. 



SUBSTITUTE SHEET (RULE 26) 



•I 



WO 98/08388 



PCT/GB97/02284 



10 

By 'acts synergistically 1 is meant that the activity of 
the combination of components iB greater than one might 
expect from the use of the components individually. For 
example, when used in conjunction with B.thuringiensis 

5 cells as an oral pesticide, the concentration of B. 
thuringieneis cellular material necessary to give 50% 
mortality in a P.brassicae when used alone is reduced by 
at least 80% when it is used in combination the agent at 
a concentration sufficient to give 25% mortality when the 

10 agent is used alone. 

It has been found that the activity of the material is 
retained by 30 kDa cut-off filters but is only partly 
retained by 100 kDa filters. 

15 

Preferably the agent is still further characterised in 
that the pesticidal activity is lost through treatment at 
25°C with sodium dodecyl sulphate (SDS - 0.1% 60 mins) 
and acetone (50%, 60 mins). 

20 

Clearly the characterising properties of the isolated ' 
agent described above can be utilised to purify it from, 
or enrich its concentration in, Xenorhabdus species cells 
and culture medium supernatants . Methods of purifying 

25 proteins from heterogenous mixtures are well known in the 
art (eg. ammonium sulphate precipitation, proteolysis, 
ultrafiltration with known molecular weight cut-off 
filters, ion-exchange chromatography, gel filtration, 
etc.). The oral pesticidal activity provides a 

30 convenient method of assaying the level of agent after 
each stage, or in each sample of eluent. Such 
methodology does not require inventive endeavour by those 
skilled in the art. 

35 The invention further discloses oral pesticidal 

compositions comprising one or more agents as described 
above. Such compositions preferably further comprise 
other p sticidal materials from non-Xenorhabdus species. 
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These other materials may be chosen such as to have 
complementary properties to the agents described above, 
or act synergistically with it. 

5 Preferably the oral pesticidal composition comprises one 
or more pesticidal agents as described above in 
combination with B. thuringxensis (or with a toxin 
derived therefrom, preferably endotoxin). 

10 Recombinant DNA encoding said proteins also forms a 
further aspect of the invention. The DNA may be 
incorporated into an expression vector under the 
influence of suitable control elements such as promoters, 
enhancers, signal sequences etc. as is understood in the 

15 art. These expression vectors form a further aspect of 
the invention. They may be used to transform a host 
organism so as to ensure that the organism produces the 
toxin . 

20 The invention further makes available a host organism 
comprising a nucleotide sequence coding for a pesticial 
agent as described above. 

Methods of cloning the sequence for a characterised 
25 protein into a host organism are well known in the art. 
For instance the protein may be purified and sequenced: 
as activity is not required for sequencing, SDS gel 
electrophoresis followed by blotting of the gel may be 
used to purify the protein. The protein sequence can be 
30 used to generate a nucleotide probe which can itself be 
used to identify suitable genomic fragments from a 
Xenorhabdus gene library. These fragments can then be 
inserted via a suitable vector into a host organism which 
can express the protein. The use of such general 
35 methodology is routine and non-inventive to those skilled 
in the art* Such techniques may be applied to the 
production of Xenorhabdus toxins other than those encoded 
by the sequence of Figure 2. 

SUBSTITUTE SHEET (RULE 26) 
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It may be desirable to manipulate (eg. mutate) the agent 
by altering its gene sequence (and hence protein 
structure) such as to optimise its physical or 
5 toxicoiogical properties. 

It may also be desirable for the host to be engineered or 
selected such that it also expresses other proteinaceous 
pesticidal materials (eg. delta- endotoxin from fi. 
10 thuringiensis) . Equally it may be desirable to generate 
host organisms which express fusion proteinB composed of 
the active portion of the agent plus these other toxicity 
enhancing materials. 

15 a host may be selected for the purposes of generating 

large quantities of pesticidal materials for purification 
e.g. by using B. thuringiensis transformed with the agent- 
coding gene. Preferably however the host is a plant, 
which would thereby gain improved pest-resistance. 

20 Suitable plant vectors, eg. the Ti plasmid from 

Agrobacterium tumefaciens, are well known in the art. 
Alternatively the host may be selected such as to be 
directly pathogenic to pests, eg. an insect baculovirus. 

25 The teaching and scope of the present invention embraces 
all of these host organisms plus the agents, mutated 
agents or agent-fusion materials which they express. 

Thus the invention makes available methods, compositions, 
30 agents and organisms having industrially applicable 
pesticidal activity, being particularly suited to 
improved crop protection or insect-mediated disease 
control . 

35 The methods, compositions and agents of the present 

invention will now be described, by way of illustration 
only, through reference to the following non-limiting 
examples and figures. Other embodiments falling within 
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th scope of the invention will occur to those skilled in 
th art in the light of these. 

FIGURE 

5 Figure 1 shows the variation with time of the growth of 
X. nematophilus ATCC 19061 and activity of cells and 
supernatants against P. brassicae as described in Example 
3. 

10 Figure 2 shows the sequence of a major part of a cloned 
toxin gene from Xenorhabdus. 

Figure 3 shows a comparison of the restriction maps of 
cloned toxin genes from two strains of Xenorhabdus 
15 (clone 1 above and clone 3 below). 



EXAMPLES 

20 

Example 1 - Use of X. nematophilus cells as an oral 
insecticide 

CELL GROWTH: A subculture of X. nematophilus (ATCC 19061, 
25 Strain 9965 available from the National Collections of 
Industrial and Marine Bacteria, Aberdeen, Scotland) was 
used to inoculate 250 ml Erlenmeyer flasks each 
containing 50 ml of Luria Broth containing lOg tryptone, 
5g yeast extract and 5g NaCl per litre. Cultures were 
30 grown in the flasks at 27 °C for 40hrs on a rotary shaker. 

PRODUCTION OF CELL SUSPENSION: Cultures were centrifuged 
at 5000 x g for 10 mins. The supernatants were discarded 
and the cell pellets washed once and resuspended in an 
35 equal volume of phosphate buffered saline (8g NaCl, 1.44g 
Na2HP0 4 and 0.24g of KH2PO4 per litre) at pH 7.4. 
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ACTIVITY OF CELL SUSPENSION TO INSECTS: The bioassays 
were as follows: P. brassicae: The larvae were allowed 
to feed on an artificial agar-based diet (as described by 
David and Gardiner (1965) London Nature, 207, 882-883) 

5 into which a series of dilutions of cell suspension had 
been incorporated. The bioassays were performed using a 
series of 5 doses with a minimum of 25 larvae per dose. 
Untreated and heat-treated (55°C for 10 minutes) cells 
were tested. Mortality was recorded after 2 and 4 days 

10 with the temperature maintained at 25°C. 



LC50 cells/g diet 

Treatment 2. days 4 tiayp 

Untreated 5.9 x 10 5 9.8 x 10 4 

15 Treated 55°C 7.1 x 10 5 1.4 x 10 5 



Aedes aegypti: The larva were exposed to a series of 5 
different dilutions of cell suspension in deionised 
water. The biosassays were performed using 2 doses per 
20 dilution of 50 ml cell suspension in 9.5cm plastic cups 
with 25 second instar larvae per dose. Untreated and 
heat-treated (55°C or 80°C for 10 minutes) cells were 
tested. Mortality was recorded after 2 days with the 
temperature maintained at 25 °C. 



25 



LC50 cells/ml 

2 flags 

Untreated 5.1 x 10 6 



Treatment 2. days 



Treated 55 °C 7.4 x 10 6 

30 Treated 80°C > 10 8 



Culex quinquefaciatus : The larvae were exposed to a 
single concentration cell suspension containing 4 xlO 7 
cells/ml * The biosassays were performed using 2 50 ml 
35 cell suspensions in 9.5 cm plastic cups with 25 second 
instar larvae per cup. Untreated and heat-treated (55°C 
or 80 °C for 10 minutes) cell6 were t sted. Mortality was 



SUBSTITUTE SHEET (RULE 26) 



WO 98/08388 



PCT/GB97/02284 



15 

recorded after 2 days with the temperature maintained at 
25°C. 



% Mortality 
Treatment 2 days 

untreated 100 
Treated 55°C 100 
Treated 80 °C 0 



10 Thus these results clearly show that cells from X. 

nematophilus are effective as an oral insecticide against 
a number of insect species (and are particularly potent 
against P. brassicae) . The insecticidal activity is not 
dependent on cell viability (i.e is largely unaffected by 

15 heating to 55°C which reduces cell viability by >99.99%) 
but is much reduced by heating to 80°C, which denatures 
most proteins . 



Example 2 - Use of X. nematophilus supernatant as an oral 
20 insecticide 



CELL GROWTH: Cultures were grown as in Example 1. 

PRODUCTION OF SUPERNATANT: Cultures were centrifuged 
25 twice at lOOOOg for 10 mins. The cell pellets were 
discarded. 



ACTIVITY OF SUPERNATANT TO INSECTS: The Bioassay was as 
follows: 

30 Activity against neonate P. brassicae and two day old 
Pieris rapae and Plutella xylostella larvae was measured 
as for P. brassicae in Example 1, but using a series of 
untreated dilutions of supernatant in place of of cell 
supensions and with mortality being recorded after 4 dayB 

35 only. 
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LC50 (pi supernatant/g diet) 
Insect species 4 days 

P. brassicae 22 
5 p. rapae 79 
P. xylostella 135 



In addition, size-reducing activity (62% reduction in 7 
days) against Mamestra brassicae was detected in larvae 
10 fed on an artificial diet containing x. nematophilus 
supernatant (results not shown). 



Thus these results clearly show that the supernatant from 
X. nematophilus culture medium is effective as an oral 
15 insecticide against a number of insect species, and are 
particularly potent against P. brassicae. 

The heating of supernatants to 55°C for 10 minutes caused 
a partial loss of activity while 80 °C caused complete 
loiss of activity. Activity was also completely lost by 
treatment with SDS (0.1%w/v for 60 mins) and Acetone (50% 
v/v for 60 mins) but was unaffected by Triton X-100 (0.1% 
60 mins), non-diet P40 (0.1% 60 mins), NaCl (1 M for 60 
mins) or cold storage at 4°C or -20°C for 2 weeks. All 
of these properties are consistent with a proteinaceous 
agent . 

The general mode of action of X. nematophilus cells and 
supernatants i.e. reduction in larval size and death 
30 within 2 days at high dosages, and other properties, eg. 
temperature resistence, appear to be similar suggesting a 
single agent or type of agent may be responsible for the 
oral insecticide activity activities of both cells and 
supernatants. 

35 

Example 3 - Timescale for appearance of ingestable 
insecticidal activity 



20 



25 



SUBSTITUTE SHEET (RULE 26) 



WO 98/08388 



PCT/GB97/02284 



17 

CELL GROWTH: 1ml of an overnight culture of X. 
nematophilus was used to inoculate an Erlenmeyer flask. 
Cells were then cultured as in Example 1 . Growth was 
estimated by measuring the optical density at 600 nm. 

5 

PRODUCTION OF CELL SUSPENSION AND SUPERNATANTS: These 
were produced as in Examples 1 and 2 . 

ACTIVITY OF CELLS AND SUPERNATANTS AGAINST P. BRASSICAE : 
10 The cell suspension bioassay was carried out as in 
Example 1, but using a single dose of suspended cells 
equivalent to 50 pi of broth/g diet and measuring 
mortality after 2 days. The cell supernatant bioassay 
was carried out as in Example 2, but using a single dose 
15 equivalent to 50 pi supernatant/g diet (i.e. more than 
twice the LC50) and measuring mortality after 2 days. 

The results are shown in Fig. 1. Thus these results 
clearly show that cells taken from X. nematophilus 

20 culture medium are highly effective as an oral 

insecticide against P. brassicae after only 5 hours, and 
supernatants are highly effective after 20 hours. 
Although some slight cell lysis was observed in the early 
stages of growth, no significant cell lysis was observed 

25 after this point demonstrating that the supernatant 

activity may be due to an authentic extracellular agent 
(as opposed to one released only after cell breakdown). 

Example 4 - Synergy between X. nematophilus cells and 
30 B.thurinqiensis powder preparations 

CELL GROWTH AND SUSPENSION: X. nematophilus cells were 
grown and suspended as in Example 1. B . thuringiensis 
strain HD1 (from Bacillus Genetic Stock Centre, The Ohio 
35 State University, Columbus, Ohio 43210, USA) was 
cultured, harvested and formulated into a powder as 
d scribed by Dulmage et al. (1970) J. Invertebrate 
Pathology 15, 15-20. 
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ACTIVITY OF X. NEMATOPHILUS CELLS AND B. THURINGI ENSIS 
POWDER AGAINST P. BRASS ICAE : The bioassays was carried 
out using X. nematophilus and B. thuringiensis in 

5 combination or using B. thuringiensis cell powder alone. 
Bioassays were carried out as in Example 1 but with 
various dilutions of B. thuringiensis powder in place of 
X. nematophilus. For the combination experiment, a 
constant dose of X. nematophilus cell suspension 

10 sufficient to give 25% mortaility was also added to the 
diet. Mortality was recorded after 2 days. 



These results clearly demonstrate the synergism between 
X. nematophilus cells and B. thuringiensis powder when 
20 acting as an oral insecticide against P. brassicae. 

Example 5 - Synergy between of X* nematophilus 
supernatants and B. thuringiensis powder 

25 CELL GROWTH AND PRODUCTION OF SUPERNATANTS : X. 

nematophilus cellB were grown and supernatants prepared 
as in Example 2. B. thuringiensis was grown and treated 
as in Example 4. 

30 ACTIVITY OF X. NEMATOPHILUS SUPERNATANT S AND Bt CELL 
POWDER AGAINST P. BRASS ICAE: 

The bioassays were carried out using X. nematophilus 
supernatants and B. thuringiensis in combination or using 
fl. thuringiensis powder alone. The Bioassay against 
35 neonate P. brassicae and two day old Pieris rapae and 
Plutella xylostella larvae were measured as in Example 2 
but with various dilutions of B . thuringiensis in place 
of X. nematophilus. For the combination experiment f a 



EiQflfiSfly 

15 B.t. alone 

B.t. plus X. nematophilus 



LC50 (yg Bt powder/g diet) 
2 4aye 
1.7 



0.09 
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constant dose of X. nematophilus supernatant sufficient 
to give 25% mortality was also added to the diet. 
Mortality was recorded after 4 days. 



5 LC50 (yg Bt powder/g) 

diet 

insect aafi&i&fi Bt alone Bt alufi Xn 

P. brassicae 1.4 0.12 

P. rapae 2.5 0.26 

10 P. xylostella 7.2 0.63 



These results clearly demonstrate the synergism between 
X. nematophilus supernatants and B.thuringiensis powder 
when acting as an oral insecticide against several insect 
15 species. The fact that both X. nematophilus cells and 
supernatants demonstrate this synergism strongly suggests 
that a single agent or type of agent is responsible for 
the demonstrated activities. 



20 Example 5 - Characterisation of insecticidal agent from 
X. nematophilus supernatant by proteolysis 

CELL GROWTH AND PRODUCTION OF SUPERNATANTS: X. 
nematophilus cells were grown and supernatants prepared 
25 as in Example 2 . 



PROTEOLYSIS OF SUPERNATANT: Culture supernatant (50ml) 
was dialysed against 0.5 M NaCl (3x11) for 48 hours at 
4°C. The volume of the supernatant in the dialysis tube 

30 was reduced five-fold by covering with polyethylene 

glycol 8000 (Sigma chemicals). Samples were removed and 
treated with either trypsin (Sigma T8253 = 10,000 
units/mg) or proteinase K (Sigma P0390 = 10 units/mg) at 
a concentration of 0.1 mg protease/ml sample for 2 hours 

35 at 30°C. 



ACTIVITY OF PROTEASE TREATED SUPERNATANT AGAINST P. 
BRASSICAE: The boassay against neonate P. brassicae 
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larvae was carried out by spreading 25 of each 
•treatment' on the artificial agar-based diet referred to 
in Example 1 in a 4 . 5 cm diameter plastic pot. Four pots 
each containing 10 larvae were used for each treatment. 
5 Mortalities were recorded after 1 and 2 days. Controls 
using water only, trypsin (0.1 mg/ml) and proteinase K 
(0.1 mg/ml) were also tested in the same way. 





% Mortality 




Treatment 


1 day 


2 days 


Untreated supernatant 


60 


100 


Proteinase K treated supernatant 


45 


100 


Trypsin treated supernatant 


40 


100 


All controls (no supernatant) 


0 


0 



Example fi 

Entoroocidal activity of other Xenorhabdus 
Using the methodology of Examples 1 and 2, four different 
20 xenorhabdus strains were tested against insect pests. 
The results obtained were as follows: 
I) Activity to Pieris brassicae 

Strain deposit Cells 10 c /grm diet Supernatant LC50 



no/code' % mortality ul/gram of diet 

NCT^*"4l0887 100 OS 

0014 100 0.52 

0015 80 3.73 
NCIMB 40886 100 0.05 



25 it was found that entomocidal activity of cells and 

supernatant was reduced by more than 99% when all four 
strains were heated at 80°C for 10 minutes. 
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II) Activity to mosquitoes (Aedes aegypti) 
Bacteria added at the rate of 10 7 cells/ml of water 



Strain deposit 


Cells loVcjnn diet 


no/code 


% mortality 


NCIMB 40887 


0 


0014 


40 


0015 


45 


NCIMB 40886 


95 



5 Furthermore, all strains significantly reduced the growth 
of Heliothis virescens . 

Cloning of toxin genes from strains of Xenorhabdus 
Total cellular DNA was isolated from NCIMB 40887 and ATCC 
19061 using a Quiagen genomic purification DNA kit. 
Cells were grown in L borth (lOg tryptone, 5g yeast 
extract and 5g NaCl per 1) at 28°C with shaking (ISOrpm) 
to an optical density of 1.5 A6oo- Cultures were 
harvested by centrif ugation at 4000xg and resuspended in 
3.5mls of buffer Bl (50mM Tris/HCl, 0.05% Tween 20, 0.5% 
Triton X-100, pH7.0) and incubated for 30 mins at 50°C. 
DNA was isolated from bacterial lysates using Quiagen 
100/G tips as per manufacturers instructions. The 
resulting purified DNA waB stored at -2 0°C in TE buffer 
(lOmM Tris f ImM EDTA, pH 8.0). 

A representative DNA library was produced using total DNA 
of NCIMB 40887 and ATTC 19061 partially digested with the 

25 restriction enzyme Sau3a. Approximately 20jig of DNA from 

each strain was incubated at 37°C with 0.25 units of the 
enzyme. At time intervals of 10 , 20, 30, 45 and 60 
minutes, samples were withdrawn and heated at 65°C for 15 
minutes. To visualise the size of the DNA fragments, the 
30 samples were electrophoresed on 0.5% w/v agarose gels. 
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The DNA sampl s v/hich contained the highest proportion of 
30 to 50kb fragments were combined and treated with 4 
units of shrimp alkaline phosphatase (Boehringer) for 15 
minutes at 37°C, followed by heat treatment at 65°C to 
5 inactivate the phosphatase. 

The size selected DNA fragments were ligated into the 
BamHl site of the cosmid vector SuperCos I (Stratagent) 
and packaged into the Escherichia coli strain XL Blue 1, 
10 using a Gigapack II packaging kit (Stratgene) in 
accordance with the manufacturers instructions. 

To select for cosmid clones with entomocidal activity, 
individual colonies selected on L agar plates containing 

15 25p>g/ml ampicillin, were grown in L broth (containing 
25jig/ml ampicillin) overnight at 28°C. Broth cultures 
(SOjul) were individually spread onto the surface of 
insect diet contained in 4.5cm diameter pots, as 
described in Example 5. To each container 10 neonate P. 

20 brassicae larvae were added. Larvae were examined after 
24, 72 and 96 hours recording mortality and size of 
surviving larvae. A total of 220 clones of NCI MB 408B7 
were tested, of which two were found to cause reduction 
in larval growth and death within 72 hours. Of 370 

25 clones from ATTC 19061, one was found to cauBe larval 
death within 72 hours. 

Example 8 

Activity of cloned toxin genes to Pieris brassicae 
30 The three active clones from Example 7 were grown in L 
broth, containing 25|ig/ml ampicillin, for 24 hours at 
28°C, on a rotary shaker at 150rpm. The activity of the 
toxin clones to neonate larvae were performed by 
incorporation of whole broth cultures into insect diet, 
35 as described in Example 1. 
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Clone Np 



Strain 



LC5Q (M,l breth/g insert diet) 



1 



NCIMB 40887 



13.03 



2 



NCIMB 40887 



16.7 



3 



ATTC 19061 



108.7 



Control* 



No effect at 100^1/g 



*XL1 Blue B. coli broth 

5 

When E. coli toxin clones were heated at 60°C for 10 
minutes and added to the diet at a rate of 100nl/g, no 
activity to larvae was detected. Highlighting the heat 
sensitivity of the toxins. 

10 

Example 9 

Sequencing of the cloned toxin from NCIMB 40887 

Cosmid DNA of the entomocidal clone 1 above from NCIMB 

15 40887 was purified using the Wizard Plus SV DNA system 
(Promega) in accordance with the manufacturers 
instructions. A partial map of the cloned fragment was 
obtained using a range of restriction enzymes EcoRl, 
BamHl, Hindlll, Sail and Sacl as shown in Figure 3. DNA 

20 sequencing was intiatiated from pUC18 and pUC19 based 

sub-clones of the cosmid, using the enzymes EcoRl, BamHl, 
Hindlll, EcoRV and PvuII. Sequence gaps were filled 
using a primer walking approach on purified cosmid DNA. 
Sequence reactions were performed using the ABI PRIStf™ 

25 Dye Terminator Cycle Sequencing Ready Reaction Kit with 
AmmpliTaq DNA polymerase FS according to the 
manufacturers instructions. The samples were analysed on 
an ABI automated sequencer according to the manufacturers 
instructions. The major part of the DNA sequence for the 

30 cloned toxin fragment is shown in Figure 2. 
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Restriction mag a£ clpneti taxia from clQne 3 

Cosmid DNA of the entomocidal clone 3 above was purified 
5 as described in Example 9. A restriction map of the 
cloned fragment was obtained using the restriction 
enzymes BamHl, Hindi II, Sail and Sacl and this is shown 
in Figure 3, When compared with the map from clone 1 
(Figure 3) it is clear that over the regions which 
10 overlap, the restriction maps are very similar. The 
only detectable difference between the two clones was a 
reduction in size of two Hindi II fragments in clone 3, 
corresponding to the 11.4kb and 7.2kb Hindlll fragments 
in clone 1 by approximately 2Kb and 200bp respectively. 
15 These results indicate the overall relatedne66 of the DNA 
region coding for toxicity in the two bacterial strains. 

Example 11 

Southern Blot flybrjdjpfttipn Experiments 

20 A 10.3kb HamHl-Sall fragment of the DNA from clone 1 was 
used as a probe to hybidise to total Hindlll digested DNA 
of the Xenorhabdus strains ATCC 19061, NCIMB 40886 and 
NCIMB 40887. Hybridisation was performed with 20ng/ml of 
DIG labelled DNA probe at 65°C for 18 hours. Filters 

25 were washed prior to immunological detection twice for 5 
minutes with 2 x SSC (0.3M NaCI, 30mM sodium citrate, pH 
7,0)/0.1% <w/v) sodium dodecyl sulphate at room 
temperature, and twice for 15 minutes with 0.1 x SSC 
(15mM NaClm 1.5 mM sodium citrate, pH 7.0) plus 0.1% 

30 sodium dodecyl sulphate at 65°C, The probe was labelled 
and experiments performed in accordance with 
manufacturers instructions, using a non-radioactive DIG 
DNA labelling and detection kit (Boehringer ) . The probe 
hybridised to a Hindlll fragment of approximately 8kb in 

35 all three strains as well as an 11.4kb fragment in NCIMB 
40887 and an approximate 9kb fragment in both NCIMB 40866 
and ATCC 19061. These results show that strains NCIMB 
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40886 and A/TCC 19061 contain DNA with cloBe homology to 
the toxin gene of clone 1 above, confirming the 
similarity between the toxine produced by the three 
strains. 
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CLAIMS 

1. An insecticidal composition adapted for oral 
5 administration to an insect comprising a pesticidal 
material obtainable from a Xenorhabdus species, or a 
pesticidal fragment thereof, or a pesticidal variant or 
derivative of either of these. 

10 2. A composition according to claim 1 wherein the said 
pesticidal material comprises material encoded by the 
nucleotide sequence of Figure 2 or variant or fragment 
thereof, or a sequence which hybridises with said 
sequence. 

IS 

3. A composition according to claim 1 or claim 2 which 
comprises cells of Xenorhabdus. 

4. A composition as claimed in any one of the 

20 preceding claims which comprises supernatant taken from 
cultures of cells of Xenorhabdus species. 

5. A composition according to any one of the preceding 
claims wherein the Xenorhabdus species is Xenorhabdus 

25 nematophilus. 

6. A composition according to any one of claims 1 to 4 
wherein the Xenorhabdus species is ATCC 19061, NCIMB 
40886 or NCIMB 40887 . 

30 

7. A composition as claimed in any one of the preceding 
claims which comprises a further pesticidal material not 
obtainable from Xenorhabdus. 

35 8. A composition according to claim 7 wherein the said 
further pesticidal material comprises a material 
obtainable from B. thuringiensis. 
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9. A composition according to claim 8 which further 
comprises cells of fl. thuringiensis. 

10. A composition according to claim 8 wherein the 

5 pesticidal materials obtainable from B. thuringiensis 
comprises the delta endotoxin. 

11. A composition according to any one of the preceding 
claims which further comprises an agriculturally 

10 acceptable carrier. 

12. A composition according to claim 10 wherein the 
carrier comprises items of insect diet. 

15 13. A method for killing or controlling insect pests , 
which method comprises administering to a pest or the 
environment thereof a composition according to any one of 
the preceding claims. 

20 14. A method as claimed in claim 12 wherein the pests 
are insects from the order Lepidoptera or Diptera. 

15. A microorganism comprising Xenorhabdus strain NCIMB 
40886. 

25 

16. A microorganism comprising Xenorhabdus strain NCIMB 
40887, 

17. A pesticidal agent which comprises a a toxin 
30 comprising a protein which is encoded by DNA which 

includes SBQ ID No. 1 or a variant or fragment thereof. 

18. An isolated pesticidal agent characterised in that 
it is obtainable from cultures of X. nematophilus or 

35 mutants thereof, has oral pesticidal activity against 
Pieris brassicae, Pieris rapae and Plutella xylostella, 
is substantially heat stable to 55°C f is proteinaceous, 
acts synergistically with fl. thuringiensis cells as an 
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oral pesticide, and is substantially resistant to 
proteolysis by trypsin and proteinase K. 

19. An isolated pesticidal agent as claimed in claim 18 

5 further characterised in that the pesticidal activity is 
substantially destroyed by treatment with sodium dodecyl 
sulphate or acetone or heating to 80°C. 

20. An isolated pesticidal agent aB claimed in claim 18 

10 or claim 19 further characterised in that the agent is an 
extracellular protein. 

21. A recombinant DNA which encodes a pesticidal agent 
according to any one of claims 17 to 20. 

15 

22. A recombinant DNA of claim 21 which comprises the 
sequence of Figure 2 or a variant or fragment thereof. 

23. A recombinant DNA which comprises or hybridises 
20 under stringent conditions with all or part of the 

sequence of Figure 2, and which encodes a pesticidal 
material. 

24. An expression vector comprising a recombinant DNA 
25 according to any one of claims 21 to 23. 

25. A host organism which has been transformed with an 
expression vector according to claim 24. 

30 26. A host organism as claimed in claim 25 which has been 
engineered or selected such that it also expresses other 
pesticidal proteinaceous toxicity enhancing materials 

27. A host organism comprising a nucleotide sequence 
35 coding for a fusion protein comprising a pesticidally 
active portion of an agent as claimed in any one of 
claims 17 to 20 in combination with other pesticidal 
proteinaceous toxicity enhancing materials. 
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28. A host organism as claimed in claim 27 wherein the 
pesticidal toxicity enhancing materials comprise delta- 
endotoxin from B. thuringiensis. 

5 

29. A host organism as claimed in any one of claims 25 to 
289 wherein the host is a plant. 

30. A host organism as claimed in any one of claims 25 to 
10 28 wherein the host is a virus pathogenic to insects. 

31. A fusion protein as expressed by a host as claimed in 
claim 27. 

15 32. An pesticidal composition comprising one or more 
agents as claimed in any one of claims 17 to 20. 



6 
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Fig.2. 

1 TCCACAATTG CCGGAGAAAA TCAGTCGGGA ACTGCCGGTG ATTATTCGTC ACTTATTAAA 

61 CGAATTTGCC GACCAGAATA AGGCTAAAAA ACTGCTACAG GCGCAACGCG ACTCGAACGA 

121 AGCGTTAACG GTAAAGAGTC ATTCGGATCC GCTGTATCGC TTTTGTGGTT ATCTGGTGTC 

X81 TGTCAATGAT ATGACCGGAA TGAAGATGGG CAATAAAAAC ATTAGCCCAC GAGCACCGAG 

241 ATTGTACTTG TATCATGCCT ATCTCTCTTT TATGGAAGCG CACGGCITTG AACGTCCGTT 

301 AACACTGACT AAGTTTGGTG AATCCATCCC CAAGATTATG CTGGAATACC GGAAGGAGTA 

361 TCGAAAAGTG CGAACCAAGA AAGGCTA7TC CTATAACGTG GAATTATCGG AAGAGGCCGA 

421 AGAATGGCTA CCGTCAGTGC CTGAGTGTCG AGACTTTAAA TCACCTGTAT AAAACTTTGA 

481 GCTTTAAGTC TGCA CTCCAT ACACAACTTA AAATATCTAA TTGTATTTAA AAGAAAATAA 

541 TAGATGTATA GTTATTTTTT AACTATACAT AAGCTCTACA TGCTCTTCAT TCGTGTAAAA 

601 AATGGGTGAA CAGGTGATAC AGTCAGTGAA TATCATATTA ATTACCGTAA ACCCAGATGT 

661 AGCAAGGCTT TCAGGGAATT GTGCAGAGGG TGCATAACTG AGAGGGTGAA AAAGATTTTC 

721 AGGGGGGCTT ATGGCAGGTA AACAAAATCA GAAGCAAATA CCGTGCACAA TCTGGTTTTT 

781 ATTTTTT GGT ACTACCTCAA ATTAAAATGA TGTAATCATC TGATTTTATT TAAGAATAGA 

841 AGTTAATCAC AATTTCATTG ATGGACTTTC ATTCACACTG GTATAGATAA ATAATTCTGT 

901 TATATCCTGT TTCATTACGC ATTCATCAGG AGTGCTGTTA CAGGAGACAA GAATGTCACA 

961 CATCATTTAC TTGTCGTTAA AGGGCAAGAA GCAGGGTTTA ATTTCAGCGG GTTGTTCAAC 

1021 GCCTGAATCA ATTGGAAATC GCTATCAAAA AGGACGTGAA GATCAAATAC AGGTATTGAG 

1061 CCTGAATCAT TCGATGAGCC GTGACCAGAA TGTTAATCAT CAACCCGTCA GTTTTGTGAA 

1141 ACCCATTGAT AAATCCTCTC CCCTGTTTGC 7GGATGCCAG TTTTGTGCAT TACAGGACAA 

12 01 GCCAGATGGG ACAACTGGAG TTCTTTTATG AAATCAAGC7 GACCAGTGCC ACGATTGTGG 

126; ATATTTCCTA TAATTATCCG GCATTCAATC AA7GA7AA7G GTGCGATACC CCATGAAGTG 

1321 GTGATGCTCG ATTATAAGTC CATTTCATGC AACCACATCG CCGCAGGACT TCGGGCTACA 

1381 GCATACGCAA TTAGCCGGAA GTGAAGAAGC AAGCCGCTTT TATCTGGGGT CTCGAATGTT 

1441 AAGCCACTTA AGAAGCCGCT GGTTGAAGAA ACCCCGGTAA AACCCGCTAA ACATCATGCC 

1501 CGTTATCGTT GTGTGGATGA TGACGGCAAT CTTTTAA CCG AACGCAAGTA TCGGGTTTGC 

1561 CTGCCGGATG GTCAGATAAA AGAAGGAAAG ACTG A7AA AC AAGGTTACAC CCAATGGCAT 

1621 CTTACGGATG ACAAAAATAA ACTTGAATTT CATATTTTAA AGGATTAATA CCATGCCAGC 

1681 CTATACCGTT CAGACAAAAA TAGAATCCAA CGTACCTTGTT GAAAACCTGC TTTACGACTT 

1741 AACCATTTAT CGTAAGGA7G CAAAAGGAAA TTTCCATATC TTGCTTGATG TTTTTCAGGA 

1801 GAAACTACAG AGTAATTATG AAACACAACA GCA7A7CACG CAGGAAATAG ACGACGATGT 

1861 TTCTGTGATT TATATTATGC AAATTATG CT TCACCGCAAA CATGGCTCAA ATATATTTCC 

1921 GGCACTGCAA ACCCATTTTA AGAAAATGTA 7AC CTC7C GGT GAATTAACTT CCGGTAAAGC 

19 Bl CTGTTCGGAG AAAAAACGGG AAAATGCCTG TTATTTTGAA AGTACAGTTG AAACAAAACC 

2041 TGTCAGCGAC GGGGATAATA CCGTTGACTT AAA7A7CACT ATTCCTGAAC GACCTTTTAT 

2101 TGCCAAAGAA TATCCCATTG GTCACCCACA CGATCCATTT GAAAAAAGTA AAATTGAATC 

2161 ATAAATACAG GACAGGTTAT CGAAAAGAAT 77A7C CGGA7 CAAAATGGAG CAAGTTTATG 

2221 TCAGGGCGCG AGCACACTAT TTTAGCTGCG 77777AAGA7 GATTAT CTCT TAA TGTTC AG 

2281 TTTTAATAGT GTTTTTATCG AGTGAAATTT AATCGCACAG GCAATTCTTT AGACTTTTAT 

2341 AGAAAACTAA AGAATTAAAG AACAAGATTG ACATTTTAAG 7TCAAATAT7 AATCAAAGTA 

2401 TGCTCGCGCC CTGAGTTTAT GTGGCCCTGC CGCTTTTTTT TATTGCCTGC CAATAGATAG 

2461 ACCAGATATT TATGAGCAAG CGGCACGAGA ATTATGGCAA TATGGCCGAA CTAAAATTGG 

2521 TCAACTGGAA ATTAAGCCGG GTGAGGGTTG CCGACATCCT AAAGGTACTT TTTATAATCA 

2581 ATATGGTGAA AGAATATCTG GGTTAGATTG GCTGACAT7G GCAAGCCTAA GAGATTCAGA 

2641 AAATATGATG ATGAGGTTGA TGATGAAGTA GCTGGTATTA CAATGTGGGG AAAATTGACA 

2701 GAATGGTTTG AAAAATCAGG GTATGAAAAA GTATTTAGTA ATGTCGGCTT ATCCCATTCT 

2761 AATATAAATG ACATAGTAAC TCTTAGTGAT TACTATAACA AAGGATATCA TGTTGTTACT 

2821 TTGATTTCAG CAGGAATGTT ATCAGATTTT GGTGACATAG AAACATCAGG AAAAAATCAT 

2881 TGGATAGTTT GGG AAGGAGT AGTAGAAAAC TATGAGAAAG AAAATATCAC AAATAATTCA 

2941 GATCTGAATC AATATGTAAA TTTAAATCTG TTTTCATGGG GTAAAGTGGA ACATCAAATT 

3001 AAAAAAAACA AATCACTAGA TTATGTACTC AA CCATA TTT TTTGA GGGTT GGTTTTTAAA 

3061 CCAATGAAAT AACATGAAAA AAATATTAAT 7ATTTTTATT TTTTTACTTT ATGGTTGTGG 

3121 TAA7CCAACG CCAAAAGTTT TACCAAAATC AGAGTTTCT7 CCTGATGCAG TGATAAATGA 

3181 ACCA7ATCAG GCA7CAAT7A CCATCACAGG AGGTGCATTG AATGAAAAAA GCGTTTGGG7 

3241 AAAAA7TCA7 CC7AC7GGC7 CAGGAC7AAC ATGGAATCCA AAAGATAGTT CTTTCC7A7A 

3301 GGG7GGAAAA AAAGAAATAA GAAAAGA77A 7CATCA7A7A AATATAACAG GTACCCCAAA 

3361 GAAGAZAGAA TTGATAAAAA 77GAAGTGG7 AGGA7TTACA 7TGGG7ACAA 7G7ACGCACG 

3421 GAAAGAGTTC AC7ATAAA77 A7ACTATAAA AGTAAGGGAA 7AATTGTCAC 7ATCAGAA7G 

3481 G7GA77TAA7 TCGCCATT77 TATACT777G TATACTC7CT CAACATAATC AGGAT7C777 
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3541 CTTATTATTT TTCATGGTGC TAAAAACGTT TATTGCAAAA ATAA ATTAAG TTAATCAGAT 

3601 AAATTATCTG CATTACTGTT ATAATCGATA ACACGATAAC CTGACTTTCT GCCTG TTCTT 

3661 ATGAACTCGA AGATAATCCT TTCTGAGCCT GAACGAATCA CATTGCAACC ACTCGCTTTG 

3721 AATCACCCAC ACCGGGACAT TCGTACGCGA GGAACGGGTT TACTCATGCT TGCCAGAGGG 

37B1 AGCAAGCCGT CCCAGATCAC CGCTGAAATC GGATGCAGTC TCCGGGTTAT CTGTAATTGG 

3841 GTTCACATGT GGCACAGATA GCGGGATTAT TCGGCGGTCA TGCCGGAGGC CGGTATCTCG 

3901 CCATGACGCC TGACATGATT GCCACTGCGC TCGAAGCCGC CAGCGCAGAG TCCCTGACGT 

3961 GCGTCGAAGC CAGGCAGGGT TTCCCTGCCT TGTACGCTTG AAACGCTGGC GAATACCCTG 

4 021 AAAAAACAGG GGCTCCCCTA TAAACGCCCC CGCCTGTCGC TTAAAAAAAG CGCAATAAAA 

4081 CGGAGTTTGC TGAAAAATCC GCCTTGCTGA ATAAAATTAA GGCCGGAGCA CAGTCAGGAC 

414^ ATTACCGTCT GGTCTATTTT GAGTTCTGGG GGCGTTAAAT TACACGGATA ACACGCTGTT 

4201 TTACCAGACA ACGTCAGGCA GTATCACGCG AGATGACGTG ATTGATTTTT TAGAGCCGGT 

4261 GGCCAGACAA GGGACAACCG CCTGACATTT TTAGTGTTGG ATAAT GCGCG TATCCATCAC 

4321 GGGATAGAGG AAAAAATCAG AAATGGCGGG TGACGAGAAC ACAACCTGTT TTTATTCTAT 

4381 CTTCCCGCTT ACAGCCCAGA GCTGTATCTG ATTGAAATCG TCTGGAAACA GGCCAAATAC 

4441 GACTGGCGAC GTTTTATCAC CTGGACTCAG GA TACAAT GG AATATGAGGT AAATACTTTA 

4501 TTGAAAGGTT ATGGCGACCA ATTTGCAATT AACTTTTCTT GAGTACTTAG TAAGAATAGA 

4561 GTCAGTCGAG GTTTTTTCAT TTCGGGTCGT GGGGATGATA CTG AAAATTT GTTTGTAATC 

4621 TCTGAAAATT GCTGTTTCTG TGGCTACGTC TGTCTTTTGG GATATTGTTT CCATCAAGTC 

4681 TGTCAACATA CTGTTAAGTT AGATGTTGAT AAAAGAGACT GAATT ATAAT ACAAAACAAT 

4741 AAATCACTTG GACAATATTT TATTTCACAT GAGACATTAA GGTT GATTT T CCCAATCTGG 

4801 TCAGTTATAA CCGAATAAGG ATCTTGAAAA ATCATGGGAT CTTACTTTTA TCAAATGAAG 

4861 TTAACGTAAA AGTTGATAAA GAAAATTATT TAATTCTAAG TGCCGT7GGC ATAAATATTT 

4921 TGTGTTTTGT TAATGAATGA ATAACCAGGT AAGCTGGATT TTCATTTTTT AATTACTCGT 

4981 TACAATATGC TATTTATTTA TATAAAGAGT TTGTGCCCAT TTAACCAGTA AACAAATTTG 

5041 TTCAACCGTA ACTTAGCTTC ATCGACTTTT GGCCTCGCCT GGTCAGAATC TAGGGCCGTT 

5101 ATCCTATTTA TTTATGATAA ATAAAATTTA ATTATCTTTA ATAAG CTGA A TATGTGGATT 

5161 TGTGCTCAAT CTTGGATTCA AGTATGTATT CCTTTTGGTA CCCTGCTTTA TTTTAAGGCA 

5221 GATGAAGAGG ATGCCAACAT GACACAATAT CGATTACGAC TGTAACATTA AAGTCAGTTA 

5281 TAAATTTTAT GATTAAAATG AAATTTTAGT AGAAAATCGT ATTCTATTCC GCCATTTACA 

5341 ATAGCATCCT CTTTAATATC ATTAATCTCA GATAAAACAA ATAATTACAA TGTGAATAGA 

5401 ATAATGACTT ACAAAATAAG CACTAAATCT TCAGATGAAC TCTTAACTGA CAACACTATT 

5461 TTATAAAATA ATTGAGGTTA TTATGTATAG CACGGCTGTA TTACTCAATA AAATCAGTCC 

5521 CACTCGCGAC GGTCAGACGA TGACTCTTGC GGATCTGCAA TATTTATCCT TCAGTGAACT 

5581 GAGAAAAATC TTTGATGACC AGCTCAGTTG GGGAGAGGCT CGCCATCTCT ATCATGAAAC 

5641 TATAGAGCAG AAAAAAAATA ATCGCTTGCT GGAAGCGCGT ATTTTTACCC GTGCCAACCC 

5701 ACAATTATCC GGTGCTATCC GACTCGGTAT TGAACGAGAC AGCGTTTCAC GCAGTTATGA 

5761 TGAAATGTTT GGTGCCCGTT CTTCTTCCTT TGTGAAACCG GGTTCAGTGG CTTCCATGTT 

5821 TTCACCGGCT GGCTATCTCA CCGAATTGTA TCGTGAAGCG AAGGACTTAC ATTTTTCAAG 

5881 CTCTGCTTAT CATCTTGATA ATCGCCGTCC GGATCTGGCT GATCTGACTC TGAGCCAGAG 

5941 TAATATGGAT ACAGAAATTT CCACCCTGAC ACTGTCTAAC GAACTGTTGC TGGAGCTATT 

6001 ACCCGCAAGA CCGGAGGTGA TTCGGACGCA TTGATGGAGA GCCTGTCAAC TTACCGTCAG 

6061 GCCATTGATA CCCCTTACCA TCAGCCTTAC GAGACTATCC GTCAGGTCAT TATGACCCAT 

6121 GACAGTACAC TGTCAGCGCT GTCCCGTAAT CCTGAGGTGA TGGGGCAGGC GGAAGGGGCT 

6181 TCATTACTGG CGATTCTGGC CAATATTTCT CCAGAACTGT ATAACATTTT GACCGAAGAG 

6241 ATTACGGAAA AGAACGCTGA TGCTTTATTT GCGCAAAACT TCAGTGAAAA TATCACGCCC 

6301 GAAAATTTCG CGTCACAATC ATGGATAGCC AAGTATTATG GTCT TGAAC T TTCTGAGGTG 

6361 CAAAAATACC TCGGGATGTT GCAGAATGGC TATTCTGACA GCACCTCTGC TTATGTGGAT 

6421 AATATCTCAA CGGGTTTAGT GGTCAATAAT GAAAGTAAAC TCGAAGCTTA CAAAATAACA 

6481 CGTGTAAAAA CAGATGATTA TGATAAACAT GTAAATTACT TTGATCTGAT GTATGAAGGA 

6541 AATAATCAAT TCTTTATATG TGCTAATTTT AAGATATCGA GAGAATTTGG GGCGACTCTT 

6601 AGGAAAAACT CAGGGACAAG TGGCATTGTC GGCAGCCTTT CCGGTCCCCT GGTAGCCAAT 

6661 ACTAATTTCA AAAGCAATTA CTTAAGTAAC ATATCTGATA ATGAATACAG AAATGGCGTA 

6721 AAAATATATG CCTATCGCTA TACGTCTTCC ACCAGCGCCA CAAATCAGGG CGGCGGAATA 

6781 TTCACTTTTG AGTCTTATCC CCTGACTATA TTTGCGCTCA AACTGAATAA AGCCATTCGC 

6841 TTGTGCCTGA CTAGCGGGCT TTCACCGAAT GAACTGCAAA CTATCGTACG CAGTGACAAT 

6901 GCACAAGGCA TCATCAACGA CTCCGTTCTG ACCAAAGTTT TCTATACTCT GTTCTACAGT 

6961 CACCGTTATG CACTGAGCTT TGATGATGCA CAGGTACTGA ACGGATCGGT CATTAATCAA 

7021 TATGCCCGAC GATGACAGTG TCAGTCATTT TAACCGTCTC TTTAATACCC CGCCGCTGAA 

7081 AGGGAAAATC TTTGAAGCCG ACGGCAACAC GGTCAGCATT GATCCGGATG AAGAACAATC 

7141 TACCTTTGCC CGTTCAGCCC TGATGCGTGG TCTGGGGATC AACAGT GGTG AACTGTATCA 

7201 GTTAGGCAAA CTGGCGGGTG TATTGGACAC ACAAAATATC CTCACACTTT CTGTCCCTGT 

7261 TATATTTCA CTGTATCGCC TCACGTTACT GGCCCGTGCC CATCAGCTGA CGGTT AATGA 

7321 ACTGTGTATG CTTTATGGTT TTTCGCCGTT CAATGGCAAA ACAACGGCTT CTTTGTCTTC 
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CGGGGAGTTG 
CGGAAATCAC 
CACCGGAAAT 
AAAGTAGTGA 
ATCTGGCGTC 
GCGGCCTGAA 
AAACGACCCA 
CACTGCGTCT 
GTGCGAGAAG 
CCACCAGTGG 
AGCAGACACT 
ACGCAGGCCA 
CGTGTTGCAG 
CGCTGGTGAA 
GGGATAAGTG 
AGACGCTGGC 
CGAATATCCA 
TGATTGATAA 
GTATTCAGCT 
TGTCAACCCG 
GGGTGTCGCG 
AGACCCGGAT 
CGGTGGAAGA 
TGTCAGCGCT 
AACGCGGGAG 
GGGTGAACTG 
ATACAAGGAT 
AAAAAGAGGA 
TGAAACTGGC 
CAACGCAGGT 
CATCAGGCTT 
ACCCGGATTT 
CCTTCAAAAA 
TATCATTCAT 
TTTTGAAGTG 
GATGGAAAAC 
TACGCTACAT 
CAAACAAATC 
TTTATCATCG 
ACCGTTTTTA 
AGATTACACT 
CGA3TTTCCA 
AACCAGTGAT 
GATATTTAGT 
TGATGTCAAA 
TATTGCTTCC 
AATCGATGCT 
GACCAAAGCC 
GGTAAATTAT 
TATGCAGCTC 
CAGAGCAAAC 
TCCGTTGGGA 
TGGCGATGAG 
GCCTTATTAC 
TTATGCCGAA 
TACCTATGAC 
TGTATTAATT 
TATCAAGAAA 
GGATTTCAAT 
TGCTTCCAGC 
GTCTATAATC 
CGGCCGCTGG 
CGTCGCACAA 
ACTTATTCTG 
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TCACGGCTGG 

CACTGAAGCG 

CAGTAATCTG 

CCGGGAGCTT 

ACCAGATATG 

TATCGCCGGA 

ACTGGTTCAA 

CAGTGAAGCA 

CCAACCGCCG 

ATTAATGGGC 

CACGGGCGAC 

TGGGTTCCCG 

TGGATACATG 

TATCCGTTAC 

GCAGACGCTG 

GGATTATACC 

GCCAGAAGGG 

TCAGGTCTCT 

CTACATCAAC 

CCAGTTTTTT 

GCTGGTTTAT 

GAT GGAT GAA 

iGGCCTTTAAA 

ATCACCGACA 

AACCTGCCGG 

GCCGCCGATG 

GCAATACGTC 

AGTGGCGAAA 

GTTTCTGCGT 

GGAGGCGGTC 

TCAGGGCGAG 

TGGCGACAAC 

GATGGAGAAC 

ACTCAAGGCA 

CCTGCCTCGT 

GGGAATATTC 

AACGCCGCTT 

AGCGCCATGA 

CAAATACCGT 

TTAAAACGGA 

AGGCGTTTGA 

TTTTCTCCAA 

TTTAAAAAGT 

TCCTATCAAT 

ATTACGGTGG 

TTGCCGGCAA 

TCATCGTTGG 

AAAGACGGGC 

AATCCGGAAG 

GGGGTGTATC 

ACGGGCATTG 

GAAGGCTTCT 

CGGTGGTTTA 

AGCGGAATGT 

GGGTATTACA 

AACACTTGGG 

AACGATGCTG 

TACAAAGGAT 

A GTGC CAGCG 

GTTTGCTACA 

CCGCCGGCTA 

AAGAGACACT 

TATGACCCGA 

CGCGGCGATA 



TTATCTGGTT 

ATCTGGTTAT 

CTTAATACTC 

CAGGCTGAAA 

GCGCGGTATA 

TTTATGATGC 

TTCTGCCATG 

GAGCTTTCTG 

GACAACACAA 

TGGGAAATCC 

AGACTGGGCC 

CCGGCGTGAA 

TGGCATCAGC 

GTGACTGCAT 

GCAGAAAATA 

GCAGAGCGCC 

GTGTCCCTGC 

TCTGCCATAA 

CGGGCGCTGA 

ACCGACTGGA 

TATCCGGAAA 

CTGCTGGAAG 

ACTTACCTGA 

ACGTCAACAG 

AATATTACTG 

CCTGGAAAGA 

CGGTCATATT 

AATGGTACTG 

CATGATGGCA 

ACTGACAAAA 

GATACTCTGC 

AATAAAAATG 

ACAGCACTCA 

ACGACTTGGT 

TGAATATGGG 

CGCAGATAAC 

TCACTGTCAG 

AACTGACGGG 

TAAACATTAT 

AAAACTATAT 

TTCTAA CACC 

ACACAATTTT 

GCAGTTATGC 

CATCCGGCTG 

TAG CTGGCA G 

ACAGTTTTGA 

CCTTTACCAA 

GAGTGCTGGG 

ATATTCTGTT 

GTATTCGTCT 

ATACTATCCT 

TTGCCAACTT 

AAATCCATAT 

TATCCGATAC 

TGCATG AAGG 

AATCTGCTTT 

ATCATGATTC 

TTTTGAATGT 

CCCTCTATTA 

GGAAAAACAA 

TATCGTTAAC 

CCTGGAATGC 

CACACTATAA 

TGGCCTATCG 



GTATCAGGTG 

TATGTACGCC 

TCCGACCCCG 

TT CTCGCG CC 

TCCTGTTGTG 

TGGTGCTGAA 

TAATGGCACA 

TGCTGGTCAT 

TATTGATACT 

CGGCTCTGAC 

TCCGTGATGG 

CCAACTTCAG 

ACTGCTCACT 

TAAACAAAGC 

TGGCAGCCGG 

TGAGTAACGT 

ACAGCCGGGA 

AAACCACCCG 

ACCGGATAGA 

CGGTGAATAA 

ATTACATTGA 

ATATCAGCCA 

CCGCTTTGAA 

CAACACCGGA 

GCGTAACGTG 

TTGGACGAAG 

CAGGGAACGT 

ATCCGGTGGA 

GTTGGAGTGC 

AACCTGACAC 

TGGTGTTTGT 

TGGCAGGCAT 

GCGTTACAGC 

AAGAAAGGCC 

TTCTGCCATC 

CAGTAAATAC 

ATATGATGGC 

GTTGGATGAA 

GGCGGTTACT 

TGCATCAGTT 

AGTTGAAAAT 

AAACACCGTT 

TGTTGATGGT 

GCTGGATATT 

TAAAACCCAC 

TGCTATGCCG 

TAATATTGCT 

TAAGATGAAG 

TCTGCGTGAA 

TAATACCCTG 

GACAATGGAA 

TGTTCTGCCT 

CGGGAATGTT 

GTCGGAAACC 

TGTCAGATTG 

CTTTTATTTT 

AGGAATGACG 

TTCTATCGCA 

CTGGGAATGT 

TTCGACGAAG 

GGAGAAATCG 

CAATCCGTTG 

AGTTGCCACC 

CGAACTGACC 



ACGCAGTGGC 

AGAGTTCAGC 

TATTAGTGAA 

GTTTATTGCT 

GACTGATAAC 

AGAGACGCTG 

GTTATCGCTT 

TTCCGATTTT 

CTGTTCTCAC 

ACGCTGGATA 

GGCTG GACAT 

TGTTGGCAGG 

GATGCCGTCG 

CGAGTCGAAT 

ACTG AGTACA 

GTTGTGCAAT 

TGACCTGTAC 

ACTGGCAGAG 

GCCTAATGCC 

CCGTTACAGC 

CCCGACCCAG 

GAGTCAGCTC 

ACCGTGGCAG 

CTGACCTGGT 

CATATATCAC 

ATTGATACAG 

TTGCACCTTA 

AACCTATGAC 

CCCCTGGTCT 

TGAACGGCTG 

GTACAAAACC 

GACCATTTAC 

CAACTGAAAA 

AGCTATCGTT 

GGTGATGATA 

TCCAGCGATA 

AGTGGCAATG 

AGTCCCAGTA 

CTGATCTGGG 

CAAGGCCACT 

AATTATTATG 

TTCACGGTTG 

AATAATTCTC 

GA CACAG GTA 

ACCTTTACGG 

TACACCTTTA 

CCTCTGGATA 

CAAACATTAT 

ACTCATTCGG 

CTGGCTTCTC 

ACCCAGCGGT 

AAATATGACC 

GGCGGTAACA 

AGTATGACAC 

GGGGTTGGAT 

GATGAGACAA 

CAACAGGGGA 

ACGGGCTATT 

TCTATTACAC 

CCACACAATG 

CCCCCTGGAT 

GATGCCATTG 

TTTATGCGCC 

CGCGATGCGT 



TGACTGAGGG 

GGGAATATTT 

GACATGGCAC 

GCAACGCTGC 

CTGCGGCCGG 

AGTGATGAGG 

TCCGTGCAGA 

GTGGTACTGG 

TCTACCGATT 

TGCTGCGCCA 

CAGTATGGTA 

ATATCAACCC 

GTTATCCGTA 

CTGCCTGCCT 

CAACAGGCTC 

TGGTTTCTGG 

AGCTATTTCC 

GCCATTGCCG 

CGTGCCGATG 

ACCTGGGGCG 

CGTATCGGGC 

AGCCGGGACA 

ACCTGAAAGT 

TTGTCGGCCA 

GGATGCAGGC 

CGGTCAACCC 

TCGTGGGTAG 

CGTTTTACTC 

TACGATATCA 

GCGCTGGCCG 

GGGGTGAGTT 

GGCGATGGCT 

ATACCTTTGA 

TCGCGCAGGA 

GTCTGACGGT 

ACCTTGCTAT 

TCATCAGAAA 

CGGCAATGCA 

GGGCCCGATC 

TGATGAACGC 

CCAGATTGTT 

GTAGCAATAA 

AGGGCTTCCA 

TTAACAATAC 

CCAGTGACCA 

AGC CACTGG A 

TCGTTTTTGA 

CGGTGAAACG 

GTGCCCAATA 

AACTGGTATC 

TACCGGAACC 

CTGCTGAACA 

CGGGAAGGCA 

TGTTTGTCCC 

ACCAGAAAAT 

AACAGCAATT 

TCGTGAAAAA 

CCGCCCCGAT 

CCCGATCATG 

GATAAACTAC 

CTGGAACTGC 

ATCCGGATGC 

TGTTGGATCA 

TGAATGAAGC 
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CAAGATGTGG 
CCAACAGTGG 
TCAACAAGAC 
CTCGTTGGTG 
TGCGTTTGCG 
GGCGAATTAC 
AGGGCGGTAG 
AGCGGGCCCG 
CAGAGCATGA 
CGACACAGAG 
TATTGGCAGA 
ACGAGGATAT 
AGTCTCTGGC 
TGTTCGGTTT 
TGATGTCGCT 
CCTACCGCCG 
AACAAATGGA 
TGGAATATCA 
AATTCACAAA 
AGTTCTTTGA 
TGACCGACAA 
TGATGGCGGG 
GTGATGAGCG 
TATCATCAGA 
GCAACGTAGG 
TGCGATTGTC 
AGTTGAAACA 
CGGTGCTGAA 
CCCACGGCGT 
CGTTTGAAGG 
CTGATCGACA 
CCATTCGTTC 
GTTTTTATGC 
GGATCACTAA 
TTTCACTGCC 
ACAGCAGTAC 
TCAGCCTGCG 
CGGATGGGGA 
CAACCTCACT 
GCGTGG CAGA 
AGACGTCTTT 
ATCATGCACG 
AAACCGTCAC 
ACTGTGATGA 
AGTCCACTAT 
TGTTGATAAT 
GAACTCCGTA 
AAAATGGCGT 
CCGTCGCTTG 
GGTTGCAGAA 
CAAGGTTTCC 
GATGATGTCC 
GCAGTCCATG 
ATATGGAGAA 
GTGCTCCGGT 
AACCACTGCC 
ACGGGCGTCT 
CGGAAGGTGA 
CGCAGGCAAA 
CAAATAGTGT 
TTCATTTGTC 
TCAGTGATAT 
GCTACTGGCC 
AAATTACGGG 



TATGTGCGTG 
GCCGCACCGT 
CTTACGGCGC 
GTTTGGTCCT 
CCTGGTTAAC 
GCGAGCCTAC 
TGCAGTGCTG 
CAATCTGGTA 
TGATGCCGAT 
CATCCGTATT 
GAGCCGCCGC 
CAACCACGGA 
CGGGCAGGCG 
CGCTTGTGGC 
TTCTGCCACA 
CCGCCGTCAG 
TGCCCAGCTG 
GGAGACCCAG 
CAAAGCGCTT 
CCTGACCCAG 
CGGTGTTACC 
TGAAACGTTG 
GGCACTGGAA 
CAACTTTAAT 
AGCTTCCGGC 
TGATTTGAAA 
AGTGAGTGTC 
TTACGGCGGC 
GAATGACAGT 
TATTTCCGTG 
GAAAGCGCTG 
TTAATTAAAA 
AGGGTTCAAC 
AAGGAATGGG 
CTTGCCGATC 
TGCTGGCAAT 
TACCGCCAAG 
AGTGTTGAGT 
GTTGGGGACG 
AAAAATCGTT 
TTGGGTACTT 
TATTGCTGAC 
GCATACCGGG 
GCATGAACTT 
GGCAATACTC 
GACTGGTTGT 
CCCGAATTCA 
TGTCGTCCGG 
TGTCGCCAAG 
GAAACACCGG 
TTGCTGCAAA 
CCGCTGGAAA 
CCGCAGTTAG 
GGAATTTCCG 
ACGGGATATC 
ACATATTCCG 
GGATTGGGTG 
ATGGACACCC 
ACTGGCTGAT 
ACGTGTCTGG 
AAATAAGCCA 
GACAGGCTCC 
GAACCTGGGG 
GAAACGTTTA 
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CTTTGGAATT 
CTCTTTCCGT 
TAGACAACGG 
GCCGGAATAT 
CTGCGCCATA 
GATCCGAAAG 
CCCGGCACAT 
GCGCAATTAA 
GAACTCACCA 
CAGCAACGAA 
AGTGCACAAA 
GAACAGCGTG 
CTCTCAGTAG 
GGCAGTCGTT 
GCTTCCCAAT 
GAGTGGGAAA 
GAAAGCCTGA 
CAGGCCCATA 
TACAGTTGGA 
TCCTTCTGCC 
TTTATCCGGG 
CTGCTGAATC 
GTGACCCGTA 
CTGACCGAAA 
AATGAATTAA 
ATTTTCAGCG 
ACCTTGCCGG 
AGCATCGTCA 
GGTCAATTTA 
AATGACAGCG 
CTGGAGAGCC 
CATTGTGATA 
ACCTTTGAAA 
AGAAGCACTC 
TCTGTCCGGC 
GGGTCATTCG 
GGCGTTCCGC 
ATTGTGCCGG 
GTTCTGACAC 
CGTTTAGAAC 
TTTACTGCGG 
CCGCAGGATG 
GAACATATTT 
GCTCAGCATT 
AGCCGGAAAC 
TTCATCTGGT 
ATGTGTCAGA 
ACAGTTTCTC 
TTCTGATGTT 
CGCTGGTTTC 
CGGCCCGCAG 
TGGATTATCA 
AAAAAATGAA 
GCGTTACTTT 
ACTGCCGAAG 
GCACAACAGG 
ATTACGGCAT 
TTTATTCCAT 
ATTGATGGGG 
TCAAATAATC 
CTGCCGGTTC 
GGGCAATCAC 
CATGGAAAAT 
ACCCCCACAG 



GCTGGGTGAT 
GGCGGGCAAC 
AGAAGGTTGC 
AACCCGGAAT 
ATCCTTCCAT 
CGCTGCTCAC 
TGTCGTTATA 
CCCAGTTCGG 
CGTTGCTACT 
CTGTCGATGA 
ATCGTCTGGA 
CGATGTCACT 
CAGAAGGGGT 
GGGGGGCAGC 
ATTCC GCAG A 
TTCAGCGTGA 
AAATACGCGG 
CTCAGGCTCA 
TGCGCGGCAA 
TGATGGCACA 
GTGGGGCCTG 
TGGCAGAAAT 
CCGTCTCGTT 
AACTCACGCA 
AACTCAGTAA 
ATACCCCGGA 
CGCTGGTTGG 
TGCCACGCGG 
TGCTGGATTT 
GTAGCCTGAC 
TGAGCGATAT 
GGCAGGCTCC 
CTTGAAATAC 
AATGCCGTCG 
GTGGTCTGGT 
GGATGGGGTG 
ACTATACGGG 
ACAGCCAAGG 
AGCCGCCTAC 
ACTGGCAGCC 
ATGGTTTAGT 
AAACCAGAAT 
ACTATCACTA 
CAGGTGTTAC 
CGCTTTTTTC 
ATTTGATTAC 
AAACAATGTG 
CCGCTATGAA 
TCAT CAGC TG 
CCGTCTTATT 
ACTGGCCCAT 
ACGTGTTAAT 
CACGTTGCAG 
ATCAGGATAC 
GAACGAATGC 
AAAGCGCGAT 
CAGGGTTACG 
TATCCGCTGT 
CTGGGCTGCC 
CGGCAGGATG 
CCGGCAAAAA 
ATCTGGTGGA 
TTGGTGAGCC 
ACTGTATATG 



GAGCCGGAGG 
CACACTGTGC 
ACTCAACCCC 
CAACCGATTA 
GAOGGGCAAC 
CAGTATGGTA 
CCGCTTCCCG 
CACCTCTCTG 
ACAGCAGGGT 
AGTGGATGCT 
AAAAT ACCAG 
GTTTGATGCG 
GGCTGACTTA 
ACTGCGTGCT 
CAAAATCAGC 
TAATGCTGAC 
CGAAGCAGCA 
GTTAGAGCTG 
GCTGAGTGCT 
GGAAGCGCTG 
GAACGGTACG 
GGAAAAAGTC 
GGCACAGTTC 
ATTCCTGCGT 
CCGCCAGATA 
AAGCTTTGGC 
TCCGTATGAA 
TTGCAGTGCT 
CAACG ATTC C 
GTTGAGTTTC 
CATTCTGCAT 
TGAGGGAGCC 
CGTCATTGCC 
GAGCGGAAGG 
GCCGGTGCTA 
GCAATGTGGG 
ACAAGATGAG 
GCAACCAGAG 
TGTTACCCGC 
ACAGCAGAGA 
GCACCTATTC 
TGCCCGCTGG 
TCGGGCAGAA 
GGCCCACCGT 
GCGGTAAAAT 
GGTGAGCGCT 
TCTGA AAAC A 
TATGGGTTTG 
AAAGCGCTGG 
CTGGATTATG 
GAAACGGACG 
CATGGCGTGA 
CCATACCAAT 
TCAGAAAGCC 
GGTTACCTAT 
GTTGTTGGAC 
GGGCTACCAC 
GCCAATGGAA 
TGACTTAGCG 
GGATCGCGCT 
TAAGCGTCAT 
AGTTACGGCA 
TCTGATGATA 
GTAGACCTAA 



ATTACGGCAG 
AAGCGGGCTA 
GCAACGCTAA 
CTGGCAAACC 
CGTTATCGCT 
CAGCCTTCTC 
GTGATGCTGG 
CTCAGTATGG 
ATGGAACTGG 
GATATTGCTG 
CAGCTGTATG 
GCGGCAGGTC 
GTTCCAAACG 
TCCGCCTCCG 
CGTTCGGAAG 
GGTGAAGTCA 
CAGATGCAGG 
TTACAGCGTA 
ATCTATTACC 
CGCCGCGAGC 
ACTGCGGGTT 
TGGCTGGAGC 
TATCAGGCCT 
GAAGGGAAAG 
GAAGCCTCAG 
AATACCCGTC 
GATATCCGGG 
ATTGCTCTCT 
CGTTATCTGC 
CCGGATGCGA 
ATCCGCTATA 
TGTTTAAGGA 
CTCTGGGGGC 
GGAGCGTCAT 
TCACTGAATT 
GTTGGTTTTA 
TATCTCGGGC 
CAACGCACCG 
TATCAGTCCC 
CGTGAGGAAG 
GGTAAGCATC 
CTGATGGAGG 
GACGATCTTG 
TATCCTGGCA 
CAGGTATCCC 
TATCTTCGCT 
ATGTGTCTGA 
AAATTCGAAC 
CAGGGGAAAA 
ACCTGAACAA 
GTACGCCAGT 
ATCTGAACTG 
TGGTTGATTT 
TGGTGGTACC 
GAGGAGGCGA 
ATCAATGGTG 
ACCATGTCAC 
TATTTCCATC 
CTTATCGGGC 
CAGGATGTTA 
CTTGTCGCAT 
AATAG CGTGC 
ACAGGCTTCC 
ATGGCTCAGG 
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Fig.2. 

15061 CACCACCCGA TTTTATTTAT GCCCGCAATA CTTACCTTGA ACTCTATGCC AATG AAAGCG 

15121 GCAATCATTC TGCTGAACCT CAGCGTATTG ATCTGCGGGA TGGGGTACGT TTTGATGATA 

15181 CTTGTCGGTT ACAAATAGCG GATACACAAG GATTAGGGAC TGCCAGCATT ATTTTGACGA 

15241 TCCCCCATAT GAAGGTGCAG CACTGGCGAT TGGATATGAC CATATTCAAG CCTTGGCTGC 

15301 TGAATGCCGT CAATAACAAT ATGGGAACAG AAACCACGCT GTATTATCGC AGCTCTGCCC 

15361 AGTTCTGGCT GGATGAGAAA TTACAGGCTT CTGAATCCGG GATGACGGTG GTCAGCTACT 

15421 TACCGTTCCC GGTGCATGTG TTGTGGCGCA CGGAAGTGCT GGATGAAATT TCCGGTAACC 

154 Bl GATTGACCAG CCATTATCAT TACTCACATG GTGCCTGGGA TGGTCTGGAA CGGGAGTTTC 

15541 GTGGTTTTGG GCGGGTGACG CAAACTGATA TTGATTCACG GGCGAGTGCG ACACAGGGGA 

15601 CACATGCTGA ACCACCGGCA CCTTCGCGCA CGGTTAATTG GTACGGCACT GGCGTACGGG 

15661 AAGTCGATAT TCTTCTGCCC ACGGAATATT GGCAGGGGGA TCAACAGGCA TTTCCCCATT 

15721 TTACCCCACG CTTTACCCGT TATGACGAAA AATCCGGTGG TGATATGACG GTCACGCCGA 

157 81 GCGAACAGGA AGAATACTGG TTACATCGAG CCTTAAAAGG ACAACGTTTA CGCAGTGAGC 

15841 TGTATGGGGA TGATGATTCT ATACTGGCCG GTACGCCTTA TTCAGTGGAT GAATCCCGCA 

15901 CCCAAGTACG TTTGTTACCG GTGATGGTAT CGGACGTGCC TGCGGTACTG GTTTCGGTGG 

15961 CCGAATCCCG CCAATACCGA TATGAAGGGG TTGTTACCGA TTCCACAGTG CAGCCAAAAG 

16021 ATTGTCCTTA AATATGATGC GTTAGGATTT CCGCAGGACA ATCTTGAGAT T GCCTA TTCG 

16081 AGACGTCCAC AGCCTGAGTT CTCGCCTTAT CCGGATACCC TGCCCG AAAC ACTTTTCACC 

16141 AGCAGTTTCG ACGAACAGCA GATGTTCCTT CGTCTGACAC GCCAGCGTTT TTCTTATCAC 

16201 CATCTGAATC ATGATGATAA TACGTGGATC ACAGGGCTTA TGGATACCTC ACGCAGTGAC 

16261 GCACGTATTT ATCAAGCCGA TAAAGTGCCG GACGGTGGAT TTTCCCTTGA ATGGTTTTCT 

16321 GCCACAGGTG CAGGAGCATT GTTGTTGCCT GATGCCGCAG CCGATTATCT GGGACATCAG 

16381 CGTGTAGCAT ATACCGGTCC AGAAGAGCAA CCCGC TATT C CTCCGCTGGT GGCATACATT 

16441 GAAACCGCAG AGTTTGATGA ACGATCGTTG GCGGCTTTTG AGGAGGTGAT GGATGAGCAG 

16501 GAGTGACAA AACAGCTGAA TGATGCGGGC TGGAATACGG CAAAAGTGCC GTTCAGTGAA 

16561 AAGACAGATT TCCATGTCTG GGTGGGACAA AAGGAATTTA CAGAATATGC CGGTGCAGAC 

1662" 1 GGATTCTATC GGCCATTGGT GCAACGGGAA ACCAAGCTTA CAGGTCAAAC GACAGTGACG 

16681 TGGGATAGCC ATTACTGTGT TATCACCGCA ACAGAGGATG CGGCTGGCCT GCGTATGCAA 

16741 GCGCATTACG ATTATCGATT TATGGTTGCG GATAACACCA CAGATATCAA TGATAACTAT 

16801 CACACCGTGA CGTTTGATGC ACTGGGGACG G7AACCAGCT TCCGTTTCTG GGGGACTGAA 

16861 AACGGTGAAA AACAAGGATA TACCCCTGCG GAAAA7GAAA CTGTCCCCTT TATTGTCC CC 

16921 ACAACGGTGG ATGATGCTCT GGCATTGAAA ZZZZZ ZATAC CTGTTGCAGG GCTGATGGTT 

169B*> TATGCCCCTC TGAGCTGGAT GGTTCAGGCC AGCTTTTCTA ATGATGGGGA GC TTTAT GGA 

17041 GAGCTGAAAC CGGCTGGGAT CATCACTGAA GA7GG77A7C TCCTGTCGCT TGCTTTTCGC 

17101 CGCTGGCATC AAAATAACCC TGCCGCTGCC A73CCAAAGC AAGTCAATTC ACAGAACCCA 

17161 CCCCATGTAC TGAGTGTGAT CACCGACCGC 7A7GA7GCCG ATCCGGAACA ACAATTACGT 

17221 CAAACGTTTA CGTTTAGTGA TGGTTTTGGG CGAAACCTTA CAAACAGCCG TACGCCATGA 

17281 AAGTGGTGAA GCCTGGGTAC CTGATGAGTA TGGAGCCAAT GTGGCTGAAA ATCAAGGCGC 

17341 CCCTGAAACG GGCGATTACA AATTTCCCG7 TGGGCAA7TT CCCGGACGTA CAGAATATTA 

17401 ACGGGAAAAG GCAAAGCCCC TGCGTTACG7 77CAAACCGT ATTCCTGAAA TAATTTGGGC 

17461 AACTATGTCA AGTTGACCAA AAAATGCCCG GCAGGA7A7G TATGCCGATA CCCATTACTA 

17521 TGATCCGTTG GGGCGTCAAT ATCAGGTTA7 CACGCCAAAG GCGGGTTGCG TCGATCCTTA 

17581 TTCACTCCCT GGTTTGTGG7 GAATGAAGTT GAAAATGACA CTCCCGGTGA ATGACAGCA7 

17641 AAAGCTCAGT GATGCCTGTT CACTGAACAG ACA7CAC7CC ATTTAGGAA7 GAA7CATGAA 

17701 GAATTTCGTT CACAGCAATA CGCCATCCG7 CACGGTACTG GACAACCGTG GTCAGACAG7 

17761 ACGCGAAATA GCCTGGTATC GGCACCCCGA TACACC7CAG GTAACCGATG AACGCATCAC 

17821 CGGTTATCAA TATGATGCTC AAGGATCTCT GACTCAGAGT ATTGATCCGC GATTTTATGA 

17881 ACGCCAGCAG ACAGCGAGTG ACAAGAACGC CATTACACCC AATCTTATTC TCTTGTCATC 

17941 ACTCAGTAAG AAGGCATTGC GTACGCAAAG TGTGGATGCC GGAACCCGTG TCGCCCTGCA 

18001 TGATGTTGCC GGGCGTCCCG TTTTAGCTGT CAGCGCCAAT GGCGTTAGCC GAACGTTTCA 

18061 GTATGAAAGT GATAACCTTC CGGGACGATT GCTAACGATT ACCGAGCAGG TAAAAGGAGA 

18121 GAACGCCTGT ATCACGGAGC GATTGATTTG GTCAGGAAAT ACGCCGGCAG AAAAAGGCAA 

18181 TAATTTGGCC GGCCAGTGCG TGGTCCATTA 7GA7CCCACC GGAATGAATC AAACCAACAG 

18241 CATATTGTTA ACCAGCATAC CCTTGTCCAT CACACAGCAA 7TAGTGAAAG ATGACAGCGA 

18301 AGCCGATTGG CACGGTATGG ATGAATTTGG CTGGAAAAAC GCGCTGGCGC CGGAAAGCTT 

18361 CAC7TCTGTC AGCACAACGG ATGCTACCGG CACGGTATTA ACGAGTACAG ATGCTGCCGG 

18421 AAACAAGCAA CGTATCGCCT ATGATGTGGC CGGTCTGCTT CAAGGCAGTT GGTTGGCGC7 

18481 GAAGGGGAAA CAAGAACAAG 7TATCG7GAA A7CCCTGACC TATTCGGCTG CCAGCCAGAA 

18541 GCTACGGGAG GAACATGGTA ACGGGATAGT GACTACATAT ACCTATGAAC CCGAGACGCA 

18601 ACGAGTTATT GGCATAAAAA CAGAACGTCC TTCCGGTCAT GCCGCTGGGG AGAAAATTTT 

18661 ACAAAACCTG CGTTATGAAT ATGATCCTG7 CGGAAATGTG CTGAAATCAA CTAATGATGC 

18721 TGAAATTACC CGCTTTTGGC GCAACCAGAA AATTG7ACCG GAAAATACTT ACACCTATGA 

18781 CAGCCTGTAC CAGCTGGTTT CCGTCACTGG GCGTGAAA7G GCGAATATTG GCCGACAAAA 

18841 AAACCAGTTA CCCATCCCCG CTCTGATTGA 7AACAATACT TATACGAATT ACTC7CGCAC 
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Fig.2. 

TTACGACTAT 
AACTATACAA 
GCGCAAGATC 
GTTCCCGGTC 
AATAGGGAAA 
GTCATTAAGA 
TTGCCAGAGC 
GTCATCACTG 
AAACCGGCGG 
AGCGGGCTGG 
TGGGGGAACC 
TATTCTGGCA 
TCGTGGACAG 
TTCCGAATGT 
CAGGGTGTCC 
GAACACCTGC 
CGAACGTTTG 
AGCG TCGCC T 
GGGGTTTTTC 
TAAACGTTCT 
ACTATTTAAC 
AGGATTAATG 
CGCCGGACAA 
AGGGCACTAT 
GAGCTTGGGTG 
GGTAATCTAT 
AGCTCTGCCA 
AGTGTCGGGA 
GTTGGTGCAG 
GCCAATGCCG 
777AACGCC7 
TG7CATGGAT 
GTCCAGTTAA 
CAGGCTGAGC 
GCCTGTATCG 
CTGGAATTGA 
CATCATTGTA 
GGCAGGTAAC 
ATCGGTAGCA 
GTTCAA ACCG 
TTTATCTTTT 
GGTTCTGGTG 
TAAATTCAGC 
TGGTTTAATA 
TAAACAGAGT 
GAGCCAGTTG 
AGTTTGTGCC 
AGCCAGCGTC 
AATTTCCCAC 
AATACGTGTT 
AAAACGGGAA 
GGCCATATGC 
CGCTTGTTTT 
AGACTGTGCA 
GACCTTATCC 
TTCAGCCAAG 
TATCAACTGG 
CATCACTGCA 
CATGGACGGA 
CAGGTTATGG 
TTTATTAATT 
ATGCAGCGCG 
TGGCAGCTTC 
CACCCAA7AA 
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GATCGTGGGG GAATCTGACC AGAATCGCAT AATTCACGAT CACCGGTAAT 
CGAACATGAC CGTTTCAGAT CACAGCAACC GGGCTGTACT GGAAGAG CTG 
CCACTCAGGT GGATATGTTG TTCACCCCCG GCGGGCATCA GACCCGGCTT 
AGGATCTTTT CTGGACACCC CGTGACGAAT TGCAACAAGT GATATTGGTC 
ATACGACGCC TGATCAGGAA TTCTACCGTT ATGATGCAGA CAGTCAGCGT 
CTCATATTCA GAAGACAGGT AACAGTGAGC AAATACAGCG AACATT ATAT 
TGGAATGGCG CACGACATAT AGCGGCAATA CATTAAAAGA GTTTTTGCAG 
TCGGTGAAGC GGGTCAGGCA CAAGTGCGGG TGCTGCATTG GGAAACAGGC 
ATATCAGCAA TGATCAGCTG CGCTACAGTT ATGGCAACCT GATTGGCAGT 
AATTGGGACA GTGACGGGCA GATCATTAGT CAGGAAGAAT ATTACCCCTA 
GCCGTGTGGG CACCCGAAAT CAGTCAGAAG CTGATTACAC AAGCCGGCGT 
AAGAGCGGGA TGCAACAGGG TTGTATTACT ACGGCTATCG TTATTA TCAA 
GGCGATGGTT GAGTGTAGAT CCTGCCGGTG AGGCCGATGG TCTCAATTTG 
GCAGGAATAA CCCCATCGTT TTTTCTGATT CTGATGGTCG TTTCCCCGGT 
TTGCCTGGAT AGGGAAAAAA GCGTATCGAA AGGCAGTCAA CATCACGACA 
TTGAACAAGG CGCTTCCTTT GATACGTTCT TGAAATTAAA CCGAGGATTG 
TTTTGGGTGT GGGGGTACAA GTCTGGGGGT GAAGCGGCCA CGATTGCAGG 
TGGGGGATCG TCGGGGCTGC CATTGGTGGT T TTGT CTCCG GGGCGGTGAT 
GCGAACAACA TCTCAGAAAA AATTGGGGAA GTTTTAAGTT ATCTGACGCG 
GCTCCTGTTC AGGTAGG CGC TTTTGTTGTC ACATCGCTTG TGACGTCTGC 
AGCTCTTCGA CAGGTACCGC CATTTCCGCA GCAACAGCGG TCACCGTTGG 
GCTTTAGCCG GAGAACATAA CACGGGCATG GCTATCAGTA TTGCCACACC 
AGTACGCTGG ATACGCTCAG GCCCGGTAAT GTCAGCGCGC CAGAGCGGTT 
CAGGCGCAAT TATTGGCGGC ATATTACTTG GCCGCCATCA GGGAAGTTCT 
AACGGGCAGC GATTGGTGCT ATGTATGGTG CTCGATGGGG AAGGATCATT 
GGGATGGCCC TTATCGGTTT ATCGGCAGGT TACTGCTCAG AAGAGGCATT 
TTTCCCACGC TGTCAGTTCC AGGAGCTGGT TTGGCCGAAT GATAGGAGAA 
GAAATATTTC TGAAGTATTA TTACCTTATA GCCGTACACC CGGTGAATGG 
CCATTGGCGG GACAGCCGCG GCCGCTCATC ATG CCGTTG G AGGGGAAGTT 
CTAGCCGGGT TACCTGGAGC GGCTTTAAGC GGGCTTTTAA TAACTTCTTC 
CTGCACGTCA TAATGAATCC GAAGCATAAC AATCATGTTC ATTCCCACTT 
GACAAGGTGG GTTTTTCGGA TGTGTGGACA GAGACCCGTA CAGGGTCTCT 
-Tri TGGATC AAGAACGAA7 GGTGTAACGG ATATGCAAAA TGATATCGCT 
AATAAGCTTT TCTGTTTACC AC7GATACCG GGAAAAC7GA GGG7TAATG7 
GCCACAGGAA GCCCT7CAAA 7GGCAGG7AC 7TAG CATCA 7 7GAAA7CCAT 
CCAC7G7CA7 7CA7GCCA7G 7GAGA7CACA ATCGC77TGC AGCCACGTGG 
CTGCCGCCA7 AACTCAG7A7 7GCCCGGACA 7CC7GA7AAG GCCC7AAAAG 
G7CACACTGA TTTGTTTG A7 ACGGCG7G7A 7TACC7AAAC CG7CAGGATA 
ATATTCAGA7 CCGATAA777 GAGGC7GGCT 7GCAG77GTG TCCCTTCGAC 
7TAAGCGTTG 7GCCTGCAC7 GCC77CACC7 GCA77GAC7A AC7CAG7CAC 
AAAATGAAAC TA7TTTCTG7 CAGACCAGCA 7ACA CTTCAG CCAGAGAAAC 
ACC7CCAG7G CCCGT7CA7C 77777CCAAA TAGCTTTTTT CCA7C7GTGC 
A7CAGGG77T CACCCGC7AA 7AAACCCGCA 7AAG7CCCA7 GCCAAGCACC 
AAGTGTGC7G CCG CA7TA77 CAA77CA7AC TGATAAGTT7 GCTC7GCCA7 
GAGACCGCCA AATCA7AAAA C7GA7AA7AA ATAGCGGACA ACGTTCCACG 
TATAGCGC7G CATTACTGAA TT7AC7TTGC AGAAAGGC7A ACTGCGCC7G 
TGC7GAG777 CCAGA7AG7T 7T77TGTAA7 ACTGCCGCT7 CACGACGTAC 
GC7AA7TGAG CATCAATTTG mTA7C7CA GC7TCCGCA7 TATTGCGCTG 
7CTTGCCGAC GGOGACGGTA 7ATTTCTGAT TGGCTGATT7 TGTCTGCGGC 
GC7GACGCAG AAATT7CGAT ACCAATCGCA CTGGCATTGA AAAGCGCCCC 
CC7CCCACAG CAAAACCG7A AATATTGGGG ACGAGATC7G CCGCGGCGGC 
AGGGCTGTGC CGCTGG7GC7 CAAGACCGA7 GAAGAGAGG7 AAAGA7CCA7 
TCACCAGCG7 7AACA7C77C G7CGTACAGC GTATTGAAAC TGTCAAAACG 
CCATGACGGC TTTCT7GAAG CGCCAAT7TA 7CAGCA7CAA 7T7CAGCCA7 
TGCATTT7AA TACTTTGCAG GGCTAAC7CA CTGCCTTGAG TTTGCAGTA7 
GCTTCTGCA7 CCTGCCGT7C AGTAATGC7G AGCAGGGTAT TGCCAAATTG 
CTTACCCCCC ACTTGGCA7T 77CCAGAA7C ACCGGAAAAC GGTACATCGG 
TGAGGTAAA7 CGCCGCCGCC 77G7GAAGCA G7GA7GGCAG CAC7GAGTAA 
TCTGCGGGCG 7GGCATAGAG AGA7AATGAC AGTGGCTGAC CG7CGA7TG7 
CGTAAGTTA7 AGAGGCGTTG CG7CAATG7C 7GCCAGTAAC C77GCAG777 
TGAGGGAGGA ACAATGCGG7 7AACGAAA7T 7GCCGTACG7 7TCG7GGG7A 
CTGACGCAG7 TGCAGCA7T7 7A7G77GA7A ATGATGCCGC A7TGTTTGGC 
TTCCAGCCGT GGC7C7GACC AATCG77A7C CAATGAAAAA TAAGGCTCA7 
AGTGAGCGCC 7G7ACA7ACC ACA77T7AGC 7TCG77TAAG G7A7CACG77 
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Fig.2. 

CAAGCTGGCG 
TAGCCACTTT 
GCGGATTGGC 
GCATTTCACC 
TCCGGTAAAA 
CAATCCCAAG 
TCATGACACG 
TGTCATTCCC 
AAAATAAAAG 
TAATCTGACA 
GGTTTCAATA 
ATTCCCAATA 
CAACTTAAAT 
CATACTTAAA 
AGTATTTAAC 
ATGCAGTGAT 
AATAACAATA 
AATATCATAG 
TTCGTGTGAA 
GCCATTTTTT 
AAGTTTTATT 
AGTGCCGCAA 
ATAGTATGCC 
AA7TCAGCAT 
GGAATTTTTT 
TCGATGGAAA 
ATTTCATCAT 
AGGTATTTAA 
TTGAACGTTA 
CCGGTAAAAC 
TCT3TAGCAA 
CTCTTAACAA 
ATGGTCATCC 
AAATAAACAA 
TTTTTATTGA 
CTGCCA TCAT 
TCTTCACTTT 
TAAATAACAG 
TCTCCCCAGG 
CAATAATATA 
ACATTAATAC 
GTCAGATAAT 
TGCTGTAACA 
TTTTCCGGAT 
GTTTCCCAGT 

ttattcaXcg 

GTTTTCACTT 
CTCTTAATCT 
GCTTCATCCA 
AA AGTA CTGG 
TAT7TTAATT 
TCCAGCCATT 
ACCGGTCTGT 
AATTGTTCGG 
TCACAACGCA 
TGCA GTGCTG 
TTTTCGCTGA 
GCCACCATGT 
TCA7CGAATG 
GTTTTGGCTG 
TCGTCATCAC 
ATATCCGGCG 
GACCATTTCT 
AAT3CTGTAT 



ATAGGCGCTA 
ATAGTGCATC 
ATTCCAGGAC 
CTGAACCGAA 
TCTGCTCTTG 
AAATAGATTG 
ACTTGAATAC 
GAATCATGAT 
AAAGCAGATT 
CCTTCACGTA 
TCAGATAACA 
TGGATCTTAA 
GTCGCATAAA 
ACATTATCAA 
CGGGTTCTGT 
ATAACGTTAC 
TCGTTATAAC 
CCTTCAATAT 
GGTTTAGATG 
AATAAAAAAC 
ATCTGTGGCT 
TATTTCCCAT 
ATACTCCTGT 
CATCTGATTC 
CCTTGGTTCT 
TAATAAAATC 
GTCTATCATA 
CCTCATCATT 
AATTAATATG 
TGGCTAATTT 
ATTGATTGTT 
TGGCGTCTAA 
CTTCTCTTGC 
TAATGACGTC 
TCAGGTTTTC 
AACGAATATG 
CATTTTTCAG 
GTCTGATATT 
CATTGGCAGC 
TTCTGGGTTC 
TGTCATGATA 
TTTTAAAGCT 
GGTTGTTCAT 
AATAGGCCAG 
CGCAGAAGAA 
CCCGGTTGAC 
GGGCAGAAAC 
GTTGAGGTGC 
GCCATGCCTG 
CGGCTTGCCA 
TTATGAGTGC 
GCAGAGTGAC 
TGCAATGCTT 
CAGTCAACGC 
TG ATCA CAGC 
TGGTTTCTGA 
GTCCAATATT 
TGCTGGTTTC 
TCAGTC CTT G 
AATCCATTTG 
CGAGTGAAAG 
TAAGGACAGT 
GTGTTGTCAG 
CAGAAAAAAG 
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TCTCCGCGGG 
GGATCATGCT 
GTATCTTCCT 
TATCCGGTCG 
CAATAAGCGC 
CATTGGCGCC 
CCCTTTTATA 
CGGCATCATT 
CCCAGGATTT 
TGTAATATCC 
TTCCTTCGTA 
ACCAACGTTC 
ACCCTTCACC 
TACCAATATT 
AAGGGCCAAT 
TTGTATCTTT 
CGCCGTCGGG 
CAACTTTTAC 
CCACATGGTC 
TAATGTTTCT 
GGTTGAACAT 
GTTATTAATG 
GTTATCTTTC 
ATAATCATAA 
TAGATGCATT 
CAAAGTTCCA 
ATCAAATAAA 
ATATATATTG 
ACCATTTCCT 
ATTTTTTGTG 
GAC TTTGTA T 
ATCATTTTCT 
AGGAAGACTA 
TTTTTCATAA 
TATTTTATCA 
TGACAGTTTT 
CTCTTTTTGT 
TTCCTGCCAT 
AAATTGACCG 
TGTCTGGCTA 
TCCGCTAATC 
ATCTTCAACG 
CATACCTGTC 
TTCAGATACG 
CTGACGGGTT 
ATATAACTGA 
TTGGTTATCA 
ACCATTTTTG 
AAGCTGGTCG 
ATCATCAAAT 
AGCAACACCA 
ATCTATAAGT 
GTGTCACAAC 
TCCTAAGTTC 
ATGGAAGCGG 
TTGGAATTTC 
GCGCACAATC 
AT TCTCTG AG 
TGGTTTTATC 
AATGCTGGCA 
TGTTGATAAT 
GCTGTAATTA 
CCACTGGGTG 
GGCAATTTTC 



TAATCAACAA 
GGGCAACGGC 
CCAATGGGCG 
GGTTCAGATA 
TGGAATACCA 
GTTTGAAATC 
TTTTTTGATA 
AGTGAATATA 
GTCATAGATA 
TTTAG CATA G 
ATAAGGTTGT 
ATCACCATGC 
TAATTGCGGC 
GGCTCTTTCA 
CTGCATATAT 
GGATTTTAGT 
TTGCTTAATA 
TTGATTAAAA 
TTCAGCATTT 
ATCTTGGATC 
AAATACACCC 
ATTGAAACAT 
CAATCTAATA 
TTTATACCAA 
AACACTCTAA 
TAATGAAAAA 
A TAACCG TTT 
CC7TTTGAAA 
GGTGATATAT 
GTTATAGATT 
TCTGTCCTGG 
GTGAGAATGG 
TTAAAAGAAT 
TCAGAAGAAC 
G7CACATTAA 
AATATATAAT 
TCCAGCCACA 
ACATTGATGG 
TGCTGGCACT 
TAACCAATTA 
ACCTGCAAGT 
GTATCGATAT 
TGACCAATAC 
CCGGCCCAGG 
TTCACTGGCT 
ATGCTGGCAA 
ATCAGCAGAT 
ATGTAGTAAG 
GATTGTTGAC 
GTTGGCATCG 
TCCGGGGTAA 
TCTCCAGTTG 
CTGAGCATCA 
CAAATGCTGT 
GTCAG CGCTT 
TCCGGTTTTG 
AGAGAAAGTT 
CGATCACGGT 
TGATTAATCC 
GCAATCAGCG 
CCATTACTTA 
TCCGTGGTCA 
CATTGGAACA 
GTGTTCACAT 



ATCCAGCATT 
GTCCGGATCG 
GACGTTCCAG 
TAGCGCAGCC 
TCATGGGCGT 
CATGGGTTCA 

rrrm 'A CTA 

AATTGATTTT 
ATTTTTTTGT 
GGAACAAAGA 
CTGGCAGAAT 
TCCTCTTTAT 
TCTGG TAAA T 
GCTAATTTTC 
TGTGTGCCTG 
TTTATATGAA 
ATAAACTCGC 
TCATATACCA 
AACTCCACTA 
TGTTCGATCA 
ATGGATCCTC 
CATTAGTAAA 
CTATGTTAGT 
CTCCAATTTC 
AATATTCGGC 
CTTCTTCTTC 
CATCTTCTAC 
AATTAATTTC 
ACGAGAGATC 
CCTTATATTC 
TATCAAGTTC 
ATAATG TCAT 
AATTGTCTTT 
AATACATACC 
AATTAAACGG 
CAGTGATATC 
GTAAATACAA 
GTATTTCAAT 
TTTGGTGATC 
AATAAGTGAG 
TAGCGACATC 
TTAACTGACT 
GAATCGTGGG 
TGCTATACCG 
TTGATACTTT 
TGGCTTCTGC 
AGCTGTACAA 
CACTGGCCGC 
TGTTCAGTCC 
GGGTTTCCGG 
TACCCAATGT 
GTAAAGGTAT 
AAATTTTAAC 
TAAGATTCTG 
GCAAAGTGGG 
TCACCAACAG 
GCCCCAGTAC 
TAGCCGCAAT 
ACAGCAAAAT 
GGGCAGCTGC 
GTGTCGTGAT 
TCAGAAACAC 
GAAAGCTGAT 
AGGGAGAAAC 



TTCATAAAGG 
ACCGAATCCA 
TAATAATCCT 
AGCGTGTCGA 
TGTAATAGAA 
GTGTTATTTT 
TCCCCTGTTG 
TCGTCTCATC 
A CCCAA CCCC 
GCGTTACTGT 
TGCCATCAAT 
TGTA GGGGGG 
TTTGCGTTTC 
TGGAAA ATAA 
ATGGCATTTT 
TTGGCGATTC 
TCACCAGAGG 
TAGGGTCAGA 
GAATATCAGA 
TAGATGAAGC 
GCGAAGGAAC 
TGATTCACAT 
ATCAAGTTTG 
TGATTTTCTA 
ATTTTTAAGA 
TTTTCCAAGC 
CATCGATAAC 
CATTGAAGGA 
AAAAATATTT 
GGCCAAATAA 
TGATAATGTG 
ATCAGGGTTA 
TTTCTCATGG 
AATGCTGGCT 
TGAGCTCCAG 
TATCTTGCCA 
ACGAGACTTG 
TTTTTTCCAT 
GACATTGCGC 
CCCCTCATTG 
TTCAAATGCG 
TTGGGAAAGT 
GTCGATATAG 
TCGATTGTAG 
TCCTTCAACA 
CACACGGGTG 
CTCATCCCGG 
TGTCGTCGTG 
CGCCTGCAAC 
TTCACCGACA 
AGCAGCGACA 
TCACTCCCAA 
GCCACCGCCA 
TCGCGTAGCT 
GAGATCATGT 
GGTCAGTTCG 
CTGACAAAAA 
AATCATGAAA 
AGTTTCTGC? 
ACGGATCAGT 
AAGGTTTTCA 
ATCACTCACA 
TAATTGCGTT 
CGACAACAAC 
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Fig.2. 

ATGGATAATT 
AGAGACAGGT 
CATAATGTAT 
AAATGGTTTT 
TGGCCATTTA 
GAATATTGGC 
TGCTGCATAT 
TCATCATAAA 
TACAAACGAA 
TAAAGTGTGC 
TGTTATACCG 
AGCAATGGGG 
AAAATCCAAG 
TTTGATTGGA 
CTTAATCAAT 
ATTCTTATAC 
TTCATCATAA 
ATCATCTGTA 
TACAAGGATT 
GTCCATTAAT 
TAAAATTGCC 
GTAAGTGGAC 
ATGCAGCAGT 
TGTCAGTGCC 
AGGATGAAAA 
GGAAGCAATT 
ACTATTGTTT 
AGCACGGGAC 
TTGATATAAA 
GCTGAAAGAC 
ATCTTATCTA 
TGATTAATGG 
ATTGATACTG 
ATGGATTTAA 
AAATTCAACT 
ATGACTAATA 
GCTACGTTGG 
CAAAAGTTGG 
AACCTCGCAA 
TGATTACTCT 
TTTCCCCGGA 
CCAGCGTTGC 
TTGTTAAACA 
AGGGATCGCT 
AAAACAGAAT 
ATGGGTCTGA 
TGGAACAGTG 
GAATATAGCC 
TGACGAGGAT 
AAGTTAAAAA 
A CGTTG ACGG 
CCTTTAAACT 
ATAACAATTT 
TACACCCATT 
TTTGACATGG 
ATGCAAGGAT 
GAGAAGATTA 
GCAATAGTTA 
ATCTAACAAT 
TTTCGTTTTC 
AACACGCTTT 
GTCCACCTCA 
TGA TTTTT GC 
TTCGCCAAAC 



CATTCACTGT 
TCTCGATGGA 
ACAGTTCAGT 
TTGATTCACC 
ATAGCGGTGT 
TGATATCTGA 
AA TATTG AAC 
TACTTTCTAT 
TCGCTTTATT 
ATTGGC ATTG 
TTGGTGATTT 
ACGAAATTTT 
TGGTCAGGTT 
ATTAGCTCTG 
CTTGCCGTTG 
TGTTGATTTG 
GACAAATCGT 
CCGAAATTTT 
TGATACAATT 
TCCGCTACGT 
TGAC GGGC TG 
AACATTTTCA 
AATTCATTAG 
A GTGAA GCAA 
TCTTTCGCTT 
GATCCCGGTT 
AGGGTTGACG 
AAAATGCGCA 
AAGTCTGTTT 
ATAAACGAAA 
ACAGTTCATT 
TTTTTAGGTG 
ATTTATCGCT 
ATAATGAATA 
CAACAATTGT 
TTTATCTATG 
AGTCAGATAA 
TATATCGTGA 
ATGCGGGGTT 
TATTCTCAGT 
CAGCACAGAC 
TCAAGCAAAA 
GGTTGGTCAC 
TATTACGGAC 
TCAGGGATAA 
CGGCCACTCC 
TTGAAATTGA 
CCGCACTCGC 
CAAAACTAAG 
TCAAAACTTA 
ATGTAAATAT 
ATATTCGGGG 
TGGTTGTGAA 
TTTCTCATCC 
TTAAGCAACT 
TGCCATAGAC 
AATTTGGGAT 
TCTAAAATTA 
TTGATAAATA 
GCGTTGAGGC 
TGTTGCCCTT 
TCCTCATAGA 
CA TTTTTTC A 
GATGCCCGTC 
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CAGATGATGA 
ACACATAAAT 
ATCATTCACT 
GGGGGTTAAA 
ATTGAACAGC 
ATGACACAAT 
ATAAAACAGC 
TAC TTGC CAG 
CAGCTTTAAC 
ATAGCATCCG 
GCTCTSTOGT 
TATCTTGGTA 
CT GTTTTTTT 
CATAGTTTAA 
COGTATCATT 
TATTTTTCTT 
AGTAGCGAGC 
TCTTCATCAG 
CAGGCGATAT 
CTGTATTACG 
ACTCATACGG 
TTACACCGTT 
ATAAGGATAA 
TGTCGGGGCG 
CCCGATATAA 
TTACAAAACG 
GTGTAATATT 
GTTCTTCAAG 
C TCGCCA CGT 
TGTTTGTCAA 
AACTTTTATC 
GAGATTATTA 
CTATTCTTTC 
CCGTATGTTA 
TTAAATATTT 
AAAGATTATT 
ATGTGTGCAA 
CAAGAGTGAT 
TTTCTTCGCA 
ACCCTTTCTT 
T ATACT CAGG 
TCTTTTCGTG 
GAACTCTATG 
TTATCCGGAA 
CAGTGGTTCT 
AACTGACAAA 
TGTCAAAACT 
GGG6T7TTTT 
TTAACGGCAG 
TTTTTTATTT 
ACAGTATTAT 
GAAATTATTA 
TATA AAGCGG 
CCGGTTTTTG 
G CCA CAT AAA 
GTTCAATTTT 
TCTTTGCCAG 
ACGTGATGGT 
AATCTGAGTT 
AATTGGCAAG 
TGAAGCACCA 
AGACCGGGTG 
TCATACTCAG 
CGGCAAAAGT 



ATGTCT GCCA 
TCTGGATTTG 
CTGA AAGCA C 
TCCAGTTTGG 
ATTGTAAAAT 
ACCAGCGCAT 
TTACCCAACA 
ATATCTTCTG 
AGGA ATATA T 
ACGGATTTGG 
CAATTTAATG 
TATATATTCT 
TACACTGAAA 
ATGTGAATCG 
CCCGTCATTG 
ACCGAAGGAG 
CAAAGAAGCA 
TTCTGTTGAA 
ATCAGTCTTA 
GCTAAATGCG 
CAGATGATAG 
ATAGTCAGTT 
TGTGGAAATT 
TCGTTTATTC 
TTCTGTTAAA 
GTGGGCGCGG 
AAGGTTAGTG 
TTTATTCTGT 
CAGAGTTCCA 
TAATAAAGTA 
ATATAAATCC 
TAATCTGATA 
AATAAAAAAT 
AAAATTAAAT 
TTAATTGTGT 
TATTGAGGAT 
AAAGAAATCC 
AGTAATGTCA 
TAATCAAAGA 
TTGGTGCTTT 
GTGGATTTAA 
ATGATGCGTG 
AATTCGCGGC 
AGCTATCTGG 
GTTTATGTTG 
GTTCGTATCG 
ATCCGCATAG 
GCTTTCTGGG 
TGGTCACTGA 
AATA GAGGA A 
AGTCCTTTGA 
TGTCAGATGT 
ATTTATTTAA 
CTGTTGTAAG 
TTGGCAGCAG 
ATTCAACCAC 
CCAAACCCTG 
TTTGGCATTA 
CTTTCTCAAG 
GTAGTGTTTT 
GTCTGCACCG 
TTTCTCTTGA 
GGTCAGGCAA 
AGCGATAGAG 



GCAGACGAAC 
TTCC GCCATT 
GTTTCATTAT 
TATTATCAGC 
GACTGGGTTG 
CGCTGACGCT 
CATTGCTGTC 
GAGATATGCC 
CACCGGGAAC 
TTAACTCGCC 
GGAATACTGT 
TTATCTCCAT 
TTATATTTGT 
TAGAAATCTT 
ACCAATGTTA 
AGATTGACAA 
TAACTCTTAA 
TTTTCOGGTG 
ATAGCCAGTA 
GTGAGGTTTT 
GGTGTCATGC 
TTCTCTAACG 
TCTTCATCCA 
AGGTGATATT 
TAAGCCGCTG 
CCATAAAACC 
ATATTAGCCA 
TTTGATTCCT 
CTTGTCCTAT 
TCACCAGCCT 
TTAAGTTATT 
GGAATATTAT 
AAAGAACTTC 
TTTAACAAAC 
TTGTGCTGTT 
GTCTTGCTTG 
TTAATAAAGT 
CATAATTTAT 
GAAAGCTATG 
GGCACAGCAG 
AGGTCCAACT 
GGTTGTTCTG 
CGCATAATAC 
AACCCCTGTT 
ACATTGATGA 
AAGGTGAAGT 
TGAAATAACT 
AGTC GGAAGT 
TTTGGTGCAT 
TGTCACCCTG 
TATGTTATTA 
TCGTAATATT 
ATAAGTTTTC 
GAAGCGGTTT 
TGGTTTCGTG 
GGGCAATAGG 
ACCTTCCGGC 
ACATATTGAT 
CTACCGACAT 
TGGTTCTTTC 
ATTTTCGGGT 
GGCAT TGGAT 
TTTTACGGTT 
GGTACTTTGA 



GCGATAAAGC 
AGCCAGTTTC 
TCCCAAATAA 
AGAAAACTCT 
TTGTTTAGTG 
AATATTATAG 
AATGG TTAA G 
TGTGGCTTTA 
TCCATCATTT 
ATAAGCGGAG 
AATGGGTATT 
TCTGG AGACG 
ATTCATTTTC 
TGCGGGTTCG 
TCAGTTGCTC 
ATAAACTGAG 
AAATCAGTAC 
TAATTTCTTC 
GCGATGTTGG 
TATCTTGCAA 
CGGTTTGCCG 
TCTGAATATT 
TATTATTCTG 
GAGAATTGTC 
GTGAAAATAT 
AACTGTTGTA 
GTTGTGGATT 
GATGAGCCTG 
GACGAAATTC 
TTTTCTATTT 
GTCAATTTAA 
GGTTAATTAA 
CCTATAATAC 
TTTCATGAAA 
TGAAAAATGA 
GTTTCAGGGG 
TGCGTAATTA 
TGAATACCCG 
AAAAAAACAC 
GGTGGCTTCG 
CCCAACCTGA 
GAAGGAAACA 
GACTCACTAT 
ACGCCTGAAT 
TAAGCGCTGG 
GGACAAAGAC 
CAAGCACTTT 
TTAACCGTAG 
AAGTTATCAA 
TAGGTGAATA 
AATTGAAAAA 
ATTAATGTTG 
ATAATTGTGA 
CCATGAAGAT 
TCACGGTTTC 
TCGGTAAAAA 
TCTTATGAAT 
TGTTAATT7C 
AAGTGATTTC 
CGGGGGTAAC 
TCAGGTTGAT 
AACGTCTCGC 
GGTGCCGCCC 
GAGAGCGATG 
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Fig.2. 

TATTCAGTAG 
AGCCAAAATG 
AGTTCCAGAT 
TTAACCAATT 
GAGAAACCGT 
TATCCCGGGT 
TTATGATTGG 
CGCGAAAATC 
GGAGTCAGCA 
TGATTCATCT 
TGGCCATTAT 
GTTTCAACAT 
AATTATGTCT 
CGGGAACCTT 
AATTAAGTTT 
TTTATGGTTT 
TTATATAGTA 
TAACAGAAAA 
GAGCTGTATT 
GTCAGCAGAT 
AGTAAAGGCG 
AGCCTCACCA 
TTTTATCCCT 
GGAACCGGCA 
GTCGAGACAA 
GATCCGTTAA 
AAAAAAGGAC 
AGGCCTTACC 
CACCTGTATA 
CTGACACATA 
ATGCTGGAGT 
CAGCGCCATG 
TTTGTCCATC 
GAACAGCTTG 
GCTGAAGGAC 
AGTCTGGACA 
C ATTAA ATGG 
ATTTTATTTA 
TATATAACGG 
CGTGAGCAGC 
CCAATTTCAT 
AAATCCAGTG 
CCAGTTCAAA 
GCAGACAGAA 
TATAGCGGCG 
GGTGAATTTT 
AGTAATCCAT 
TGGGGGGATA 
CGTAGGTCAT 
CCATTTTGGC 
AGGCGATGTT 
GTTCCGGGTC 
CCACCGCAGT 
GCACGTTGCT 
ATACTGAATG 
AATCCCCCCA 
CTGGCATCAC 
TAAG GATC AA 
TCTGCTTGCC 
TCACCGTTGA 
TCCAGATCAA 
TCTGGCGACA 
AGCGGTTCTG 
GC7GAAGAAT 



CTCATTGATT 

TTGTGGCGAG 

GGCAGGCCTT 

TACCCAACGA 

ACTCCCTTCA 

TTTCTGGATA 

CATGACTCAG 

GGACTGAGTT 

AATGAGTTAT 

ACCGGTGGTA 

ATCAAAGTTA 

GGCAGTTATG 

GGTGATTCAG 

CCACTGATGG 

ATATTTCATC 

TATATTTAAT 

AATAAATTCT 

TTCATGGTTA 

TACTGTAGAA 

AATATGTTGT 

ATTAATAACC 

TGACGCGTTA 

T TATCT GCCG 

GCTTTGTGGA 

CCCACGGGGA 

TGGCCTGGCG 

ATACTGAACT 

CTTACTCAAA 

ATCAGCCCCT 

AAAGCATTGC 

GGGTTCCCCA 

TTGTGTTAAG 

AACTGACTGA 

AACAAAAAGG 

GGGAAGAAGG 

TCATTGTCAC 

ATACGCTTTT 

CTACGATTTA 

TCCCATATCA 

ATTTGCCAAC 

TTTGGTTGCA 

ACCACCGTCA 

CTGATTTTTC 

GAATTCCGGA 

CGTTGTCAGA 

CTCCGGTTGT 

GTCCCGATCA 

CAGGTTAGTA 

CACAAAGATA 

AACGACGGCG 

CAGTGCTTCA 

GAATTCATTA 

AAACATGGGA 

CAGGATCTTT 

CGAGTTCCAG 

GTAAACCGGA 

GGCGCTGATC 

CGGGTACAAT 

GGTTCCACCC 

CGGCCATAAA 

AACCACGGCC 

AACGCGCATC 

TATTTTCCGG 

AACTCAAAGG 
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TTAAGTGTAA 

TGCTGTAATA 

CCCGCCGGGA 

TGAACGGAAG 

TGTAACATCA 

GCTTTTTTCA 

TCCATTTTGG 

CCCTTCAAGT 

TCCCCATAAT 

TGTGGATTCC 

CTTTCAGTAA 

TTTATTTTAC 

CTAAAGGCAA 

TATTGAATAA 

TGGTTTCTGC 

GCCAATCATA 

GTTGGATGTG 

GGAAATTCAA 

CTCGCATTGA 

ATATTGGCTG 

GATAAAACAG 

TTCAAACATT 

GAAGCGATCC 

CAGGCAATTA 

CGGTTACATT 

GCTGATGTAT 

CCCTTTGGTC 

TCGATGGCTG 

GCCGTTGGTG 

CTTGATGGAG 

ATTGGTGGCG 

CTATATTTTA 

ACAATCTCCG 

GCGTGAGCAA 

CAAGCTGGAA 

CAGTACCGGC 

TCACAGCAGG 

CGACGGGTTA 

ATCTTCTCTT 

AGGCCATCAT 

TAAATTCCCT 

GCATTAAAGA 

CCGCGTGCAA 

GCACCTTTTT 

TCAGCACCCA 

ACACCTTGTG 

GGATTGGGCG 

TGGTGACCGA 

TTGTCTAAAT 

CTACAGGCTA 

CGCAGCTCTT 

CCTTCTTCAC 

AAACGCCGGG 

GGCCATCACA 

CTTATGCCCT 

GGCTGCATCC 

CGCGTCCAGA 

ATGGCCTAAT 

GTCAACAACC 

ACTGAAAATC 

GGGGGCATCG 

ATACTGGCAC 

ATCAACTTCA 

AGTTCCGCTG 



TAAGCTCAAG 
AGAAAGAAAT 
GGCTTTTAAG 
AACGTGAACA 
AGAGCGCGGT 
TCGGACGTCG 
GATTTCTTTT 
GATCTACTAT 
ACCTGACCAT 
TTGGTGCGAT 
AAAGGACGCT 
AGACAGTAGT 
AGGGGATTAT 
ATATTGATTC 
AAT TAAGTT T 
TTATTTTTCT 
ATTATTATTG 
TCAACTTTTG 
TACT GGATT G 
TGGATTTTTC 
AGAGACGGAT 
TTTTAACCCA 
GGTCAGTGTG 
CGTCAGTTGC 
TATTGCCTGA 
TATTCGCTGT 
GTCCC CCTG C 
GATTGTTTTA 
GATATCAGTG 
CTGGTACAAA 
TTG TTGAATG 
CTG AATGG A C 
GAAA 
GG C CGGACAG 
GCGCG 
CTG AG CCGGG 
ATA TGG TG A C 
CTTTAGGAAG 
TTCCGCGTAC 
CCTGATCGCC 
TATGCAGCAC 
GTGCGTCAGC 
TTTCATATTC 
CCATCGTGCC 
GACATGAACG 
ACAGTAAAAA 
GAGGAGGGTT 
TGTATTCTGC 
AAGGTGCGAT 
TCGTGATTTC 
TCACTAACAA 
CTGTGGCGCC 
AAGAAGTCGA 
GAGAA ATAC C 
GCCTGTTTTG 
TGATTGTAAT 
CCGACATTGC 
GTAATAGGGG 
TCATTAATCC 
AGGCGGTCGT 
TCGCTGGTCA 
CAGTCAGTAA 
TATTCGTTGT 
CCGTCAGGTT 



GCTCCATCGT 

GACTGTGAAG 

TCCTTCCAAC 

GTGAAGCGTT 

GAAGCGACGT 

TTCATTTCGG 

GATTTGGCGA 

TTT GAAATCT 

GTGGTTGTTT 

AGTCAGAAAG 

GCTGATATTG 

AAACAATTTA 

AAGCTTGAAA 

ATTATTATTT 

TAAAAATTAA 

TATAATAATT 

TGAG ACGGT A 

TCCGGTTTCC 

ATTAGCCGGA 

AGCGAGATGA 

TGTGGCCAGG 

ACCAGAAACC 

TGATTTACCA 

ACAGTGATGT 

TTGAACACCA 

CAGCCATGGC 

TGTTTTATCA 

CACTCTCTGA 

CGCTCAGTGA 

AACATATCCG 

CCGGTTATAA 

ATACGCTGGA 

CCATGTTGAT 

AAGGCAGAAC 

CATTATTACG 

AGAAAATTGA 

CCCTGTGAGG 

CTGAATGAGA 

AGGTAAGTAA 

TGACCAAGAG 

AGTGCGGGGC 

GTCGGTTTCC 

CGCATCGTAT 

CAGTGGCTCT 

TCCATAGTTA 

GCGGATCGCC 

ATCGCCGTCA 

CC AACCG GTA 

TTCTTTGAAG 

TTTACGGGCC 

AACATAGTTT 

GGGGTATTCC 

CGATGCTACT 

CTGACATACT 

CTCGCGCTTT 

ATTGCAAGAA 

GTGTGGTGCC 

CAATCTGGCC 

GTTCGGATAA 

AGGCGGTAGG 

GCGCAGTGTT 

TATAGGCAGA 

ACAGGGACTT 

TATATCCCAC 



GAACGGAGAT 
AGCGGAGCTA 
CCGTATAATG 
CTGGAAACGT 
GCATAGTCCT 
GGTATTGATG 
TTAATCAGAT 
TATTTAATCA 
ATCCGGGAAA 
ATATTGACTC 
TGAACTACAT 
GCAATAAGCA 
TTAAAACAAA 
ATGGA TAAGA 
TTCTACTTTT 
GATAGTTTAT 
ATAATTAACA 
TGACCATGAA 
CGA GTGTT GG 
TAGCTTTGGC 
AAAGCAAAAA 
GCCCGGGAAT 
CACTAAAACT 
GCTGTATTCT 
GTCCACGCCT 
TGCGCATCTG 
TGGTGAGGTG 
ACACGCGGCT 
TGAAGAGATC 
TTG CCGGG AT 
TAGCGCCGAA 
TCTCGCCCAG 
GACTATTGCA 
AGAAGGCAGA 
GCATGGTGTC 
AGCGTTAAAG 
CCAC CGGAA A 
CGTCCTTTGT 
CCCAAACCTT 
AAGATCCCGC 
GTATCCAGTG 
GTGTCTGTCA 
TGGTTATTCA 
CCTGTTCTGT 
GCAAATCCGA 
TCATCTGCCG 
TATTCATATC 
CCAAAGAAGT 
CTGGACTTCT 
CGGGTTCCAA 
GGGCCATCAT 
CAGTCGATAT 
CACAAATGTA 
CCAGCCGCCG 
CAGATTACGC 
ATTCTTCGGG 
TAAATCACCA 
ACTGC TGGCT 
CTTGCCTTTG 
CGGGATTTTT 
ATCCTGGGTT 
GACTTTAGGC 
GGCAACACGT 
CTTCTGATAG 
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Fig.2. 

GTTTCTTCTG 
GGCGTATTGG 
CAGGCATTCT 
TCAACCCAGC 
C CAGAT TCAT 
GCTTTAATTG 
TATTCCGGGG 
AGACGGCTAC 
TAACGCAGAT 
AATTCCAACT 
TCAAAATTGA 
TTAGCCTGTA 
TAATCGATAG 
GCCCGTACAT 
CGCCGATCAG 
AATGAACCCG 
ATGGTTTTTA 
TGTAAGGCTA 
CCACTTTGGA 
AGTAAACAGA 
TCATCCACAT 
CCGAAGCATA 
TCCGCCACAG 
GATTGTAATG 
TAGTGTTGCT 
TGTTGCATGT 
TGCTGCATAC 
GTGCTGCCGT 
AAACGATAAT 
TGGTAACGCT 
AACTCTTTAC 
TGCCACAGTT 
GAGACGGATG 
TCCTGTGCCC 
CAGTTTGCCG 
ATCTGCTTGG 
TCTAACAGAT 
GCTGCAATGG 
TCAATAATTG 
TCATCTTCCA 
TCTATATTCA 
ATCCCATTTG 
GTAATAAGCT 
A GTTTTTT T G 
GCGGGGAAGG 
7GCATACCGA 
GGCTTAAAGA 
GGATAGGTTA 
TTATCGTGCA 
TTAAA GTCA G 
ATCAGTTTGA 
TAGGAAAGCG 
TGCCTTAAGG 
CCTTTTCGTC 
TCGCAGACAT 
TGAAGACGGA 
ATCACAGCAA 
TACTCCAGCT 
AG GAAC CCAA 
ATCTTTAGTT 
GAAAATGGGG 
CTCGCTCCAG 
TGAACGCCAG 
GCGTTTGATG 



TGAGTGCATC 
GGTTACCGTG 
CATAAACCGG 
CGATGTTTTT 
TCTGCCAGGC 
AAGGGTATCG 
CCGGCTCCTG 
GAACCTGCTG 
GGAG GATAA T 
TCCACTGTGA 
GCCTGAACAT 
CGTTCAGCAA 
TTTTTAAGTC 
CTTCATAAGG 
CGTGGGTCGG 
TACTCTTGCC 
TGATCTCCAG 
ATTTCAGATG 
CGAAATTCAT 
GGGAATTAAC 
GTATAACGCA 
CCGCCAAGAC 
CGGCCGCAGT 
CGATAACTTC 
GGCGCTCCTG 
CAATCGCTTG 
GGAAGTGTTC 
ACTGAATCAG 
TCGGGATTGC 
GCCTGAGCAG 
CATCCAAGGT 
TGAGATATTC 
TATCATATGT 
GTGCTTCACT 
CGAGCAGAAT 
CGACAGTACG 
TCTTGACATA 
CATCCGGATC 
GCCGTGAATT 
GCGGATTAAA 
GTCGCCACGC 
CGCCATTTAA 
GTGTATTCCA 
CAAACAGTGT 
AAAGGAATTG 
TTCTGAACTC 
GAAGTTCGGC 
AATCAAGAAC 
TCAGCCGGAA 
ATTGAGCGAC 
TACTGTCTTT 
TTCTTCCCGG 
CATCCTTGTG 
CATCAGCATA 
AACCGAAGGC 
CAAACGGAAC 
CCATCTGGGC 
GCCATCATAT 
TCACCCGCAC 
TCAGACTGTT 
CGTATCAAAT 
GCATTGGGAG 
TAATATTGGT 
TTAACACCAT 
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ATATTGCAAT 
ATCGGCAATT 
TAAATCAGGT 
AAAAACCGCG 
AACCAGAGAT 
ATAAACATTT 
ATATCAGTTA 
GCTGAACTCA 
ATCGCTCAGC 
AATGGCGCCT 
GCCGCTGTCT 
AACGTTTTCG 
AGCAGTACTG 
CCCCAGCAAT 
ATAATCGCGC 
AATTTCCCAC 
ACGTCTGGTG 

rrcrccGACc 

ATCATACTGA 
CGCATCATAG 
TCATTGGTAN 
CATCCCCCCG 
GGCGGCTGAC 
CTGCTCGGTG 
AGCGGCCCGT 
CTGTTGCAGA 
AAAATC GGTA 
CGTTTCTGCG 
CCGGCGTTTC 
ATCTTGCGGG 
CAGGTTATGA 
CGTATCAACA 
CACAGGCAGA 
GTTTTCATAC 
ATCAGGGCTG 
TGCACTGGAT 
GAAACGGAAT 
GGTTGGTTCA 
CCAGTAACGC 
AATATAGTGC 
GACCAGAAAT 
ATCAATCGGC 
GCTCAGTACC 
ATTAAGGCGA 
CACCTGATCC 
TTCAGTACAG 
TTTCAG GGTG 
TTTTTCGCTC 
AGAACCGTTG 
AATCTCCAGT 
GCCGAAATCA 
GTTGCCGGAT 
TTCACGTGAG 
TTGGTCATCC 
TTCGTCATAA 
CAGAGCCAGA 
CATCCGGTAT 
TTGGCATAAG 
TAGGCTCAAC 
CTTCAACTTC 
TGGGGTCTAC 
ATAACGCATC 
ATGGCTGTGT 
CTTCATAACC 



ACCTCGGTTT 
TCTTCCGGTG 
GAAATATTGC 
CTATCATAAA 
GCGCCTACTT 
TGAGACATAA 
GAATTGTCTT 
TTACTTCCGC 
GACTCCAGCA 
GTCCCTTCAA 
TCCATGGCCG 
GGTTTGGTGT 
TAAAGCGTAT 
GCGGGCAATG 
AAGAACATTT 
TGTGATGATG 
TTATGTTGCA 
AGCAGCCCCT 
CCTGTTTCGT 
GCTTGCAGGT 
ANTTGTTCNN 
ACGGCCAGAC 
TGGGCAGCGA 
ATGGAGATGT 
CGGCTGATGG 
TTGCGGGTAA 
TTGT CTTTTT 
GCCTCTTTTG 
ATGCCCGCCA 
CTGATGGGTT 
CGTAAGTTAT 
GGTTTGACAA 
AGTGGCACGT 
AGAGCCACAT 
GTACCCAGTA 
GTCA GCTT AC 
ATTGCTTTCC 
ATTAACATCC 
GGTTTACCTT 
AGCCATTCGG 
GGCATATGGA 
GTAGGGAATG 
TGCGGGATAC 
ATGTTTTGTG 
TGTTCA TTGA 
CTGGCACTTT 
ATTCGATTAT 
AGTACCAGTG 
TAATATTGAT 
GTGTCATCAG 
GGGTTCATTC 
AGAGCACCAT 
TAATACCAGA 
GGCAAATCAG 
TCATAATCCT 
AACGGGTTAT 
TG CAGATG TC 
CGATTTTGAT 
GTTTTGGTTA 
CGTCCAGGCA 
GCTGCCCAAT 
GGTATCAGGA 
ACGGGTACGT 
TGCGATAACT 



TTTCTCCCGG 
TCGCCTCACG 
GGTCGGGAAT 
TGACATACCA 
CGCTGCTGGC 
T TTCAC TTCC 
GTTTTAATTG 
CACTCACATC 
GCTGATCCTG 
AAGGCAGGAA 
TTGAAATCAC 
ATTCCAAGGG 
TGCTGAGTTG 
ACAGCGCTAC 
CGGCGCTCAG 
TCAGTAATGA 
AATACGCCTG 
GATAAAGATC 
ACTGCCAGGA 
AAAGCCGGAG 
NNNNNNNNNN 
CGAAAATATT 
TCACACCTTC 
TTTCATCATA 
TCAGTGCATC 
AGCTGTACAG 
TCTCCAGCAA 
CCCGGCTCAT 
TAOGATTAGC 
CATCGTATAA 
ATAGACGCTG 
ATAAATCAGA 
TGCTGACAGT 
CTTGCAGCGT 
ACATATTGAC 
GGTATTCCAT 
GGTAGTGAAT 
GGTACACGGT 
GGTTGCTGGC 
TGGCCTCTTT 
AAAACAGTTC 
AACCGGGTAT 
CCTGACTGGC 
GCGCGTTATC 
GTTTAATCAG 
CATTGCCAAC 
CCGACCCCAG 
GTTGTTCATC 
GATCTTCTAT 
TGCCATGAAC 
CGGTTTGGAT 
AGTACGGTAA 
CCAGGTTGCC 
TAATTTCTAC 
TACCTTTCTT 
GCGGGTCTTG 
TTCGCGCAGA 
CCGGTCAGGA 
TGCAGTGATA 
ATATACAGGC 
GGCAGGTCAA 
TGGCGTATCG 
CCGACAAAGA 
TTCAGGTTAC 



CGGTACATCA 
GACATATTGC 
A TGCCA GCGT 
GGTTTGACCA 
GTCAGACATC 
G GCCCC GTTA 
ATGTTTATTC 
ACGCGCGGTA 
ATCGGAACCG 
AAGTTCATCA 
CACACCTTGA 
GTTAAGCAAA 
TACCAGTGAA 
GGTTTTTATA 
TAAGA AAGTG 
TTTTACCGAT 
ATCCATCCGT 
ATTCCAGAGA 
G GCTT CGGCC 
ATTTGGCTGA 
NNNNNNNNNC 
GGGAACCATA 
AGCCGCTCTT 
GAGCGATTTA 
CAATGAAGCC 
CCCCAGTTGC 
ACTCAGTAAC 
GATCGGGGTG 
CACAACACGC 
TCCGGCCGGA 
ATCCAACATT 
CGGTGCGGCA 
AAGCATTAAC 
ACGGGGTTGC 
GGAGTCATAG 
GTCTCCCTGA 
GGGTTCACTG 
GGGTGGAGGA 
CTGAACAAGT 
TAATCGTTGT 
CCAGAAATAG 
AGGCTGTTCG 
AATGGCGATC 
AGTTTCATCT 
TTCGCGAATA 
ACCACCTTTG 
CTTGATTGAT 
CAAGACAGTA 
CGCACCAAAC 
AAAATTGACA 
TCTCCGGCAA 
TCGATAGGAT 
GACATATTTT 
CAGCAGTGTA 
ATCTGTCCCC 
CTGTATATCC 
ATGGTGGGTG 
ACGGTGTGGG 
ACGCAGTTGT 
GATTATTCAG 
TAGG7TTCCA 
AAAGATTCAG 
AGAACTTATC 
TGACATCTTC 
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Fig.2. 



3B101 AAAATTATTC AGATAACCGA GCACCGCTTG TTGTACAGAA TCTTCGGTJA TTTTTCCCTG 
38161 ATTAAGGGCA CTTTCCAGTT GGAAGAAGAA TTCTGTTTTA TTCAGGCGTA ACAGGGGTTC 
38221 CAGATAGCTT TCCGGATAAG TCCGTAATAA GCGATCCC 
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